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LEAFY COTYLEDON 1 GENES AND THEIR USES 



CROSS REFERENCE TO RELATED APPLICATIONS 
The present application is a Continuation-ln-Part ("CIP") of United States 
Patent Application Serial Number (USSN) 09/193,93 1 , filed November 17, 1998, which is a 
CIP of USSN 09/103,478, filed June 24, 1998, which is a CIP of USSN 09/026,221 , filed 
February 19, 1998, which is a CIP of USSN 08/804,534, filed February 21 , 1997. Each of 
the aforementioned applications is explicitly incorporated herein by reference in their entirety 
and for all purposes. 

FIELD OF THE INVENTION 
The present invention is directed to plant genetic engineering. In particular, it 
relates to new embryo-specific genes useful in improving agronomically important plams. 

BACKGROUND OF THE INVENTION 
Embryogenesis in higher plants is a critical stage of the plant life cycle in 
which the primary organs are established. Embryo development can be separated into two 
main phases: the early phase in which the primary body organization of the embryo is laid 
down and the late phase which involves maturation, desiccation and dormancy. In the early 
phase, the symmetry of the embryo changes from radial to bilateral, giving rise to a hypocotyl 
with a shoot meristem surrounded by the two cotyledonary primordia at the apical pole and a 
root meristem at the basal pole. In the late phase, during maturation the embryo achieves its 
maximum size and the seed accumulates storage proteins and lipids. Maturation is ended by 
the desiccation stage in which the seed water contem decreases rapidly and the embryo passes 
into metabolic quiescent state. Dormancy ends with seed germination, and development 
continues from the shoot and the root meristem regions. 

The precise regulatory mechanisms which control cell and organ 
differentiation during the initial phase of embryogenesis are largely unknown. The plant 
hormone abscisic acid (ABA) is thought to play a role during late embryogenesis, mainly in 
the maturation stage by inhibiting germination during embryogenesis (Black, M. (1991). In 
Abscisic Acid: Physiology and Biochemistry. W. J. Davies and H. G. Jones, eds. (Oxford: 
Bios Scientific Publishers Ltd.), pp. 99-124) Koornneef, M., and Karssen. C. M. (1994). In 
Arahidopsis. E. M. Meyerowitz and C. R. Sommerxalle. eds. (Cold Spring Harbor: Cold 
Sprmg Harbor Laboratory Press), pp. 3 1 3-334). Mutations which effect seed development 



and are ABA insensitive have been identified in Arabidopsis and maize. The ABA 
insensitive (abi3) mutant of Arabidopsis and the viviparousl (vpl ) mutant of maize are 
detected mainly during late embryogenesis (McCarty, et a!., (1989) Plant Cel\ 1, 523-532 and 
Parcy et a!., (1994) Plant Cell 6, 1567-1 582). Both the VPl gene and the AB13 genes have 
been isolated and were found to share conserved regions (Giraudat, J. (1995) Current 
Opinion in Cell Biology 7:232-238 and McCarty, D. R. (1995). Annu. Rev. Plant Physiol. 
Plant Mol. Biol. 46:71-93). The VPl gene has been shown to function as a transcription 
activator (McCarty, et ai, (1991) Cell 66:895-906). It has been suggested that AB13 has a 
similar function. 

Another class of embryo defective mutants involves three genes: LEAFY 
COTYLEDONl and 2 (LECl, LEC2) and FUSCA3 (FUS3). These genes are thought to 
play a central role in late embr>'ogenesis (Baumlein, et al. (1994) Plant J. 6:379-387; Meinke, 
D. W. (1992) Science 258:1647-1650; Meinke et al.. Plant Cell 6:1049-1064; West et al., 
(1994) Plant Cell 6:1731-1745). Like the abi3 mutant, leafy cotyledon-type mutants are 
defective in late embryogenesis. In these mutants, seed morphology is altered, the shoot 
meristcm is activated early, storage proteins are lacking and developing cotyledons 
accumulate anthocyanin. As with abi3 mutants, they are desiccation intolerant and therefore 
die during late embryogenesis. Nevertheless, the immature mutants embryos can be rescued 
to give rise to mature and fertile plants. However, unlike abi3 when the immature mutants 
germinate they exhibit trichomes on the adaxial surface of the cotyledon. Trichomes are 
normally present only on leaves, stems and sepals, not cotyledons. Therefore, it is thought 
that the leafy cotyledon type genes have a role in specifying cotyledon identity during 
embryo development. 

Among the above mutants, the led mutant exhibits the most extreme 
phenotype during embryogenesis. For example, the maturation and postgermination 
programs are active simultaneously in the led mutant (West et al., 1994), suggesting a 
critical role for LECl in gene regulation during late embryogenesis. 

In spite of the recent progress in defining the genetic control of embryo 
development, further progress is required in the identification and analysis of genes expressed 
specifically in the embryo and seed. Characterization of such genes would allow for the 
genetic engineering plants with a variety of desirable traits. For instance, modulation of the 
expression of genes which control embryo development may be used to alter traits such as 
accumulation of storage proteins in leaves and cotyledons. Alternatively, promoters from 



embryo or seed-specific genes can be used to direct expression of desirable heterologous 
genes to the embryo or seed. The present invention addresses these and other needs. 

SUMMARY OF THE INVENTION 

5 The present invention is based, in part, on the isolation and characterization of 

LECl genes. The invention provides isolated nucleic acid molecules comprising a LECl 
polynucleotide sequence which is at least 68% identical to the B domain of SEQ ID NO:2. 

The invention also provides expression cassettes comprising a promoter 
operably linked to a heterologous polynucleotide sequence or complement thereof, encoding 

1 0 a LEG 1 polypeptide comprising a sequence which is at least 68% identical to the B domain of 
SEQ ID NO:2. In some embodiments, the polynucleotide sequence is heterologous to any 
element in the expression cassette. In a preferred embodiment, the B domain comprises a 
polypeptide between about amino acid residue 28 and amino acid residue 11 7 of SEQ ID 
NO:2. In a more preferred embodiment, the B domain comprises a polypeptide sequence 

1 5 with an amino terminus at amino acid residues 28-35 and a carboxy terminus at amino acid 
residues 103-1 17 of SEQ IDNO:2. 

In particularly preferred embodiments, the LECl polypeptide is shown in SEQ 
ID NO:20 or 22. Such LEC l polypeptides can be encoded by the polynucleotide sequences 
shown in SEQ ID NO: 1 9 or SEQ ID NO:2 1 , respectively. In another embodiment, the LEC 1 

20 polypeptide is a fusion between two or more LEC 1 polypeptides of polypeptide 
subsequences. 

the expression cassette comprises a promoter operably linked to the LECl 
polynucleotide or Hs complement. For example, the promoter can be a constitutive promoter. 
Alternatively, the prciYnoter can be a promoter from a LECl gene. For instance, the LECl 
25 promoter can be from about nucleotide 1 to about nucleotide 1 998 of SEQ ID NO:3. In one 
embodiment, the heterologous polynucleotide can be linked to the promoter in the antisense 
orientation. In another embodhjient, the promoter is SEQ ID NO:23. The promoter can 
further compromise SEQ ID NO:^4. 

In another embodiment, the invention provides an expression cassette 
30 comprising a promoter operably linked to a heterologous polynucleotide sequence, or 

complemem thereof, encoding a LECl polypeptide comprising a subsequence at least 90% 
identical to the A or C domain of a LECl polypeptide. The polynucleotide sequence can be 
heterologous to any element in the expression cassette. Such expression cassettes can encode 
fusions of two or more LECl polypeptides or polypeptide subsequences. 
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The invention also provides for an expression cassette for the expression of 
heterologous polypeptides in a plant. The expression cassette comprises a LECl promoter 
operably linked to a heterologous polynucleotide. In some embodiments, the LECl promoter 
is at least 70% identical to SEQ ID NO:23. In some embodiments, the expression cassette 
promoter comprises a promoter at least 70% identical to SEQ ID NO:24. Preferably, the 
promoter comprises the sequence displayed in SEQ ID NO:24. 

The invention also provides an isolated nucleic acid or complement thereof, 
encoding a LECl polypeptide comprising a subsequence at least 68% identical to the B 
domain of SEQ ID NO:2, with the proviso that the nucleic acid is not clone MNJ7. In a 
preferred embodiment, the B domain comprises a polypeptide sequence with an amino 
terminus at amino acids 28-35 and a carboxy terminus at amino acids 103-1 17 of SEQ ID 
NO:2. m another embodiment, the LECl polypeptide is shown in SEQ ID NO: 20 or SEQ 
ID NO:22. Such LECl polypeptides can be encoded by the polynucleotide sequences shown 
in SEQ ID NO:19 or SEQ ID NO:21, respectively. In another embodiment, the LECl 
polypeptide is a fusion between two or more LECl polypeptides of polypeptide 
subsequences. 

The isolated nucleic acid can further compromise a promoter operably linked 
to the LECl -encoding nucleic acid. The promoter can be a constitutive promoter. 
Alternatively- the promoter can be a promoter from a LECl gene. For instance, the LECl 
promoter can be from about nucleotide 1 to about nucleotide 1998 of SEQ ID N0:3. In one 
embodiment, the heterologous polynucleotide can be linked to the promoter in the antisense 
orientation. 

The invention provides a host cell comprising expression cassettes or nucleic 
acids of the invention. Thus, in one embodiment, the host cells of the invention comprise an 
expression cassette comprising a promoter operably linked to a heterologous a polynucleotide 
sequence, or complement thereof, encoding a LECl polypeptide with a subsequence at least 
68% identical to the B domain of SEQ ID NO:2. In other embodiments, the host ceil of the 
invention comprises an expression cassette comprising a promoter operably linked to a 
heterologous polynucleotide sequence or complement thereof, encoding a LECl polypeptide 
with a subsequence at least 90% identical to the A or C domain of a LECl polypeptide. 
Other embodiments include hosts cells comprising an expression cassette comprising a 
promoter at least 70% identical to SEQ ID NO;23 or an isolated nucleic acid comprising a 
subsequence at least 68% identical to the B domain of SEQ ID NO:2, so long as the nucleic 
acid is not clone MNJ7. 
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The invention also provides isolated polypeptides comprising amino acid 
sequences at least 68% identical to the B domain of SEQ ID NO:2 and capable of exhibiting 
at least one of the biological activities of the polypeptides encoded in SEQ ID NO: 1, SEQ ID 
NO: 19 or SEQ ID NO:21 , or a fragment thereof. Antibodies capable of binding the above- 
described polypeptide are also provided. 

Also provided are methods of introducmg an isolated nucleic acid into a host 
cell The method comprises providing an expression cassette of nucleic acid of the invemion 
as described herein and contacting the expression cassette or nucleic acid with the host cell 
under conditions that permit insertion of the nucleic acid into the host cell. 

The invention also provides transgenic plant cells or plants comprising an 
expression cassette comprising a promoter operably linked to a heterologous polynucleotide 
sequence, or complemem thereof, encoding a LECl polypeptide comprising a sequence 
which is at least 68O/0 identical to the B domam of SEQ ID NO:2. In a preferred embodiment, 
the LECl polypeptide is shown in SEQ ID NO: 20 or SEQ ID NO:22. Such LECl 
polypeptides can be encoded by the polynucleotide sequences shown in SEQ ID NO:19 or 
SEQ ID NO:21, respectively. The invention also provides plants that are regenerated from 

the plant cells discussed above. 

The expression cassette promoter can be a constitutive promoter. 
Alternatively, the promoter can be a promoter from a LECl gene. For instance, the LECl 
promoter can be from about nucleotide 1 to about nucleotide 1998 of SEQ ID NO:3. In one 
embodiment, the heterologous polynucleotide can be linked to the promoter in the antisense 
orientation. In another embodiment, the promoter is SEQ ID NO:23. The promoter can also 

further comprise SEQ ID NO:24. 

The invention also provides an expression cassette for the expression of a 
heterologous polynucleotide in a plant cell, comprising a promoter polynucleotide at least 
70O/O identical to SEQ ID NO:23, wherein the promoter polynucleotide is operably linked to a 
heterologous polynucleotide. In one embodiment, the promoter polynucleotide is SEQ ID 
NO:23. The promoter can also further comprise a polynucleotide at least 70% identical to 
SEQ ID NO:24. In a preferred embodiment the promoter comprises SEQ ID NO:24. 

The invention also provides methods of modulating transcription comprising, 
introducing into the plant an expression cassette containing a plant promoter operably linked 
to a heterologous LECl polynucleotide, the heterologous LECl polynucleotide encoding a 
LECl polypeptide composing a subsequence at least 68% identical to the B domain of SEQ 
,D NO-2 and detecting a plant with modulated transcription. Embodimems of these methods 
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i„e.ud= where ,he LEC, pCynuCeodde is SEQ ,D N0:2. SEO .D NO:20 or SEQ ,D N0.2 • 
,„„,he, =m.oai™e„,s, the LEC, polypeptides are encoded hy SEQ ,D NO:,, SEQ ,D NQ:, , 
or SEQ ,D NO-2, . Preferred embodiments of the invention include the method where 
transcrtption moduiation resuhs in tndnetion ofembyonic eharaetensties in a piant. ,n an 
ahernattve embodintent, transcription modulation results in .ndue.ion of seed development. 
The invention also provides a method of detecting a nuCeic acid ,n a samp,=. , he 
method comprises providing an isolated EEC, nuclcc acid molecule comprtstng a 
polynucleotide sequence, or complement thereof, encoding a LEC, po,ypeptide „Hh a 
Lscquence at least 68o/. tdentical to the B domain of SEQ ID N0:2., eontacttn, the tsoiated 
nucleic acid molecule with a sample under conditions which permit a comparison of t e 
sequence of the isolated nucleic actd molecule with the sequence of DNA in the sample, and 
analyzing the result of the comparison. In some embodiments, the isolated nucleic acd 
molecule and the sample are contacted under conditions wh.ch permit the forma.ton o, a 
duplex between complementary nucleic acid sequences. 

Definitions 

The phrase "nuclete acid" refers to a single or double-stranded polymer of 
deoxyribonucleotide or nbonucleotide bases read from the 5' to the y end. Nuclete act<.s may 
also tnCude modtfted nuCeotides that permit correct read tiuough by a polymerase and do 
not alter expression of a polypeptide encoded by that nucleic actd. 

The phrase "polynucleotide sequence" or -nucleic acid sequence .nCudes 
both the sense and amtsense strands of a nuCeic aerd as either tndtvidual single strands or ,n 
the duplex. It includes, bu, is no. limited to, self-repltcating piasmids, chromosomal 
sequences, and infectious polymers of DNA or RN A. 

The phrase "nucleic acid sequence encoding" refers to a nucle.c acd whtch 
, directs the expression of a spee.f.c protetn or peptide. The nucleic acid sequences include 
both the DNA strand sequence that is transcnbed into RNA and the RNA sequence that ,s 
translated into protein. The nucle.c acid sequences include both the full length nucletc ac.d 
sequences as well as non-full length sequences derived from the full length sequences. ., 
should be further understood that the sequence includes the degenerate codons of the nauve 
sequence or sequences whtch may he tntroduced to provide codon preference ,n a spectfic 
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host cell. 



The term "promoter" refers to a regton or sequence determtnants located 
upstream or downstream from the star, of transcr.ption and which are .nvolved m recognit.on 
and bindtng of RNA polymerase and other proteins to initiate transeript.on. A plant 



promoter" is a promoter capable of initiating transcription m plant cells. Such promoters 
need not be of plant origin, for example, promoters derived from plam viruses, such as the 
CaMV35S promoter, can be used in the present invention. 

The term "plant" includes whole plants, shoot vegetative organs/structures 
(. , leaves, stems and tubers), roots, flowers and floral organs/structures (e.g. bracts, sepals, 
petals, stamens, carpels, anthers and ovules), seed (mcluding embryo, endosperm, and seed 
coat) and fruit (the mature ovary), plant tissue (e.g. vascular tissue, ground tissue, and the 
like) and cells (e.g. guard cells, egg cells, trichomes and the Uke), and progeny of same. The 
class of plants that can be used in the method of the invention is generally as broad as the 
class of higher and lower plants amenable to transformation techniques, including 
angiosperms (monocotyledonous and dicotyledonous plants), gymnosperms, ferns, and 
multicellular algae. It includes plants of a variety of ploidy levels, including aneuploid, 
polyploid, diploid, haploid and hemizygous. 

A polynucleotide sequence is "heterologous to" an organism or a second 
polynucleotide sequence if it originates from a foreign species, or, if from the same species, is 
modified from its original form. For example, a promoter operably linked to a heterologous 
coding sequence refers to a coding sequence from a species different from that from which 
the promoter was derived, or, if from the same species, a coding sequence which is different 
from any naturally occurring allelic variants. As defined here, a modified LECl coding 
sequence which is heterologous to an operably linked LECl promoter does not include the T- 
DNA insertional mutants as described in West et al., Tke Plant Cell 6:173 1-1745 (1994). 

A polvnucleotide "exogenous to" an individual plant is a polynucleotide which 
is introduced into the plant by any means other than by a sexual cross. Examples of means 
bv which this can be accomplished are described below, and include Agrobacterium- 
mediated transformation, biolistic methods, electroporation, in planta techniqties, and the 
like Such a plant containing the exogenous nucleic acid is referred to here as an R, 
generation transgenic plant. Transgenic plants which arise from sexual cross or by selfing are 

descendants of such a plant. 

As used herein an "embryo-specific gene" or "seed specific gene" is a gene 
that is preferentially expressed during embr>'o development in a plant. For purposes of this 
disclosure, embryo development begins with the first cell divisions in the zygote and 
continues through the late phase of embryo development (characterized by maturation, 
desiccation, dormancy), and ends with the production of a mature and desiccated seed. 
Fmbrvo-specific genes can be further classified as "early phase-specfic'- and "late phase- 



specific'' Earl V phase-speofic genes are those expressed in embryos up to the end of embryo 
morphogenesis. Late phase-specific genes are those expressed from maturation through to 
production of a mature and desiccated seed. 

A "LECl polynucleotide" is a nucleic acid sequence comprising (or consistmg 
of) a coding region of about 100 to about 900 nucleotides, sometimes from about 300 to 
about 630 nucleotides, which hybnd.zes to SEQ ID N0:1 under strmgent conditions (as 
defined below), or which encodes a LECl polypeptide. LECl polynucleoUdes can also be 
identified by their ability to hybridize under low stringency condifions (e.g., Tm ~40°C) to 
nucleic acd probes having a sequence from position 1 to 81 in SEQ ID NO:l or from 

posifion 355 to 627 in SEQ ID NO: 1 . 

A "promoter from a LECl gene" or "LECl promoter" will typically be about 
500 to about 2000 nucleotides in length, usually from about 750 to 1500. Exemplary 
promoter sequences are shown as nucleotides 1-1998 of SEQ ID NO:3 or as SEQ ID NO:23. 
A LECl promoter can also be idenfified by its ability to direct expression in all, or essentially 
all proglobular embryonic cells, as well as cotyledons and axes of a late embryo. 

>^ "LECl polypeptide" is a sequence of about 50 to about 210, sometimes 100 
to 150 amino acidresidues encoded by a LECl polynucleotide. A fi.ll length LECl 
polypeptide and fragments containing a CCAAT binding factor (CBF) domain can act as a 
subunit of a protein capaWe of acting as a transcripfion factor in plant cells. EEC I 
polypeptides are often distinguished by the presence of a sequence which is required for 
binding the nucleotide sequence: CCAAT. In particular, a short region of seven residues 
(MPIANVI) at residues 34-40 of SEQ ID NO: 3 shows a high degree of similarity to a region 
that has been shown to required for binding the CCAAT box. Similariy, residues 61-72 of 
SEQ ID NO: 3 (IQECVSEYISFV) is nearly identical to a region that contains a subunit 
interaction domain (Xing, et al., (1993) EMBQ J. 12:4647-4655). 

As used herein, a homolog of a particular embryo-specific gene (e.g., SEQ ID 
NO- 1 ) is a second gene in the same plant type or in a different plant type, which has a 
polynucleotide sequence of at least 50 contiguous nucleotides which are substantially 
identical (determined as described below) to a sequence in the first gene. It is believed that, 
in general, homologs share a common evolutionary past. 

"Increased or enhanced LECl activity or expression of the LECl gene" refers 
to an augmented change in LECl activity. Examples of such increased activity or expression 
include the following. LECl activity or expression of the LECl gene is increased above the 
level of that in wild-type, non-transgenic control plants (i.e. the quantity of LECl activity or 



expression of the LECI gene is increased). LECI activity or expression of the LECI gene >s 
,n an organ, tissue or cell where it is not normally detected in wHd-type, non-transgen,c 
control plants (i.e. spatial distribution of LECI activity or expression of the LECI gene is 
increased). LECI activity or expression is increased when LECI activity or expression ot the 
5 LECI gene is present in an organ, tissue or cell for a longer period than in a wild-type, non- 
transgenic controls (i.e. duration of LECI activity or expression of the LECI gene is 
increased). 

A "polynucleotide sequence from" a particular embryo-specific gene is a 
subsequence or full length polynucleotide sequence of an embryo-specific gene which, when 
,0 present in a transgenic plant, has the desired effect, for example, inhibiting expression of the 
endogenous gene driving expression of an heterologous polynucleotide. A full length 
sequence of a particular gene disclosed here may contain about 95%, usually at least about 
98% of an entire sequence shown in the Sequence Listing, below. 

The term "reproductive tissues" as used herein includes fruit, ovules, seeds, 

1 5 pollen, pistols, flowers, or any embryonic tissue. 

In the case of both expression of transgenes and inhibition of endogenous 
genes (e g by antisense, or sense suppression) one of skill will recognize that the inserted 
polynucleotide sequence need not be identical and may be "substantially identical" to a 
sequence of the gene from which it was derived. As explained below, these variants are 

20 specifically covered by this term. 

In the case where the inserted polynucleotide sequence is transcribed and 
translated to produce a functional polypeptide, one of skill will recognize that because of 
codon degeneracy a number of polynucleotide sequences will encode the same polypeptide. 
These variants are specifically covered by the term "polynucleotide sequence from" a 
25 particular embryo-specific gene, such as LECI . In addition, the term specifically includes 
sequences (e.g., full length sequences) substantially identical (determined as described 
below) with a LECI gene sequence and that encode proteins that retain the function of a 

LECI polypeptide. 

In the case of polynucleotides used to inhibit expression of an endogenous 

30 gene, the introduced sequence need not be perfectly identical to a sequence of the target 
endogenous gene. The introduced polynucleotide sequence will typically be at least 
substantially identical (as determined below) to the target endogenous sequence. 

Two nucleic acid sequences or polypeptides are said to be "identical" if the 
sequence of nucleotides or amino acid residues, respectively, in the two sequences is the 
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same when aligned for maximum correspondence as described below. The term 
"complementary to" is used herem to mean that the sequence is complememary to all or a 
portion of a reference polynucleotide sequence. 

Optimal alignment of sequences for comparison may be conducted by the 
local homology algorithm of Smith and Waterman AM API. Matk. 2:482 (1981), by the 
homology altgnment algorithm of Needle man and Wunsch J Mol. Biol. 48:443 (1970), by 
the search for similarity method of Pearson and Lipman Proc. Nail. Acad. Sci. (U.S.A.) 85: 
2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, 
BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group (GCG), 575 Science Dr., Madison, WI), or by inspection. 

'•Percentage of sequence identity" is determined by comparing two optimally 
aligned sequences over a comparison window, wherein the portion of the polynucleotide 
sequence in the comparison window may comprise additions or deletions (i.e.. gaps, as 
compared to the reference sequence (which does not comprise additions or deletions) for 
optimal alignment of the two sequences. The percentage is calculated by determining the 
number of positions at which the identical nucleic acid base or amino acid residue occurs in 
both sequences to yield the number of matched positions, dividing the number of matched 
positions by the total nui^iber of positions in the window of comparison and multiplying the 
result by 100 to yield the percentage of sequence identity. 

The term "substantial identity" of polynucleotide sequences means that a 
polynucleotide comprises a sequence that has at least 25o/o sequence identity. Alternatively, 
percent identity can be any integer from 25o/o to 100%. More preferred embodiments include 
at least: 25%, 30%, 35%, 40%, 45o/o, 50%, 55%, 60%. 65%, 70o/o, 75%, 80%, 85%, 90%, 
950/0 or 99%. compared to a reference sequence using the programs described herem; 
preferably BLAST using standard parameters, as described below. Accordingly, LECl 
sequences of the invention include nucleic acid sequences that have substantial identity to 
SEQIDNOI SEQlDNO:3,SEQIDNO:4,SEQIDNO:19andSEQIDNO:21. LECl 
sequences of the invention include polypeptide sequences having substantial identify to SEQ 
ID NO-2 SEQ ID NO:20 or SEQ ID NO:22. One of skill will recogmze that these values can 
, be appropriately adjusted to determine corresponding identity of proteins encoded by two 
nucleotide sequences by taking into account codon degeneracy, ammo acid similarity, 
readmg frame positioning and the like. Substantial identity of amino acid sequences for these 
purposes normally means sequence identity of at least 40%. Preferred percent identity of 
polvpeptides can be anv integer from 40o.o to 100%. More preferred embodiments include at 
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least 60% 65% 70%. 75%. 8O0/0, 85%, 90%. 95%, or 99-o. Most preferred embodiments 
.nclude 67%. 68%. 69%, 70%, 71%. 72-o. 73%, 74^0 and 75%. Polypeptides wh.eh are 
"substantially s.mHar" share sequences as noted above except that res.due positions wh.ch are 
not identical n.av dUTer by conservative amino acid changes. Conservative ammo ac.d 
substitutions refer to the interchangeability of residues having similar side chains. For 
example a group of amino acids having aliphatic side chains ,s glycine, alamne, valine, 
leucine and isoleucine; a group of amino acids having ahphatic-hydroxyl side chains is 
serine and threonine; a group of amino acids having amide-containing side chains .s 
asparagine and glutamine; a group of amino acids having aromatic side chains is 
phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is 
lysine argimne, and histidine; and a group of amino acids having sulfur-contam.ng side 
chains IS cysteine and methionine. Preferred conservative amino acids substitution groups 
are; valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alan.ne-valine, 
aspartic acid-glutamic acid, and asparagine -glutamine. 

Another indication that nucleotide sequences are substantially identical is if 
two molecules hybridize to each other, or a third nucleic acid, under stringent conditions. 
Stringent conditions are sequence dependent and will be different in different circumstances. 
Generally, stringent conditions are selected to be about 5°C lower than the thermal melting 
point ,Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength and pH, at which 50% of the target sequence 
hvbridizes to a perfectly matched probe. Typically, stringent conditions will be those m 
which the salt concentration is about 0.02 molar at pH 7 and the temperature is at least about 



60°C. 



25 
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In the present invention, mRNA encoded by embryo-specific genes of the 
invention can be identified m Northern blots under stringent conditions using cDNAs of the 
invention or fragments of at least about 1 00 nucleotides. For the purposes of this disclosure, 
stringent conditions for such RN A-DN A hybridizations are those which include at least one 
wash in 0 SSC at 63"C for 20 minutes, or equivalent conditions. Genomic DNA or cDN A 
comprising genes of the invention can be identified using the same cDNAs (or fragments of 
at least about 100 nucleotides) under stringent conditions, which for purposes of this 
disclosure, include at least one wash (usually 2) in 0.2X SSC at a temperature of at least 
about 50"C. usually about 55°C. for 20 minutes, or equivalent conditions. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
I^H^ure 1 A shows a schematic representation of the three domains of the LECl 
polypeptide. Figu,^ 2B shows a comparison of the predicted amino acid sequence of the B 
domain encoded by LEG 1 with HAP3 homologs from maize, chicken, lamprey, Xenopus 
laveis human, mouse, rat, Emencella n.dulans, Schizosaccharomyces pombe, 
Saccharomyces cerevisiae. and Kluyveromyces lactis. The DNA-binding region and the 
subunit interaction region are indicated. Numbers indicate amino acid positions of the B 
domains. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
■n,e present invention provides new embryo-speeific genes useful in 
genetieally engineering plants. PCynucleotide sequence, front the genes of .he invention ean 
be used for instance, to direct expresston of desired heterologous genes ,n embryos (,n the 
ease of promoter sequences) or to modulate development of embryos or embyon.c 
characteristics on other organs (e.g.. by enhancing expression of the gene in a transgen.c 
plant) m pattieular, the inventton provides a new gene from Arabidopsis referred to here as 
LECl LECl encodes polypeptides which subuni.s of a protein which acts as a transcript.on 
factor Thus, modulation of the expression of this gene can he used to manipulate a number 
of useful tratts, such as increasing or decreasing storage protein content in cotyledons or 

Generally, the nomenclature and the laboratory procedures in recombinant 
DNA technology described below are those well known and commonly employed in the art. 
Standard techniques are used for cloning, DNA and RNA isolation, an.plification and 
purification. Generally enzymatic reactions involving DNA ligase, DNA polymerase, 
restriction endonucleases and the like are performed according to the manufacturer's 
specifications. These techniques and various other techmques are generally performed 
according to Sambrook et al. Molecular Clomng - A Laboratory Manual, 2nd. ed.. Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York, (1989). 
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Unlation of nncieic acids of the invention 

The isolation of sequences from the genes of the invention may be 
accomplished by a number of techniques. For instance, oligonucleotide probes based on the 
sequences disclosed here can be used to identify the desired gene in a cDNA or genomic 
DNA library from a desired plant species. To construct genomic libraries, large segments of 
genomic DNA are generated by random fragmentation, e.g. using restriction endonucleases, 
and are ligated with vector DNA to form concatemers that can be packaged into the 
appropriate vector. To prepare a library of embryo-specific cDNAs, mRNA is isolated from 
embryos and a cDNA library that contains the gene transcripts is prepared from the mRNA. 

The cDNA or genomic library can then be screened using a probe based upon 
the sequence of a cloned embryo-specific gene such as the polynucleotides disclosed here. 
Probes may be used to hybridize with genomic DNA or cDNA sequences to isolate 
homologous genes in the same or different plant species. 

Alternatively, the nucleic acids of interest can be amplified from nucleic acid 
samples using amplification techniques. For instance, polymerase chain reaction (PGR) 
technology to amplify the sequences of the genes directly from mRNA, from cDNA, from 
genomic libraries or cDNA libraries. PGR and other in vitro amplification methods may also 
be useful, for example, to clone nucleic acid sequences that code for proteins to be expressed, 
to make nucleic acids to use as probes for detecting the presence of the desired mRNA in 
samples, for nucleic acid sequencing, or for other purposes. 

"Appropriate primers and probes for idenfifying embryo-specific genes from 
plant tissues are generated from comparisons of the sequences provided herein. For a general 
overview of PGR se6 PGR Protocols: A Guide to Methods and Applications. (Innis, M, 
Gelfand, D., Sninsky, J. and White, T., eds.). Academic Press, San Diego (1990). 
Appropriate primers for this purpose include, for instance: UP primer - 5' GGA ATT GAG 
GAA GAA GGG AAG GGC A 3" and LP primer - 5' LP primer - 5' GCT GTA GAG ATA 
GAA GAG TTT TCC TTA 3'. Alternatively, the following primer pairs can be used: 5' 
ATG AGG AGG TGA GTC ATA GTA GC 3' and 5' GGG AGA GAT GGT GGT TGG TGC 
TG 3' or 5' GAG ATA GAG AGG GAT GGT GGT TG 3' and 5' TGA CTT ATA CTG AGG 
, ATA ATG GTG 3'. A third set of primers include: 5'-AGG ATG CAT GGA AGG TGG 

AGG GTT GGA T-3' and 5'-ATG TAG ATG AGT AGT TAT GTT GTT GAG TCG-3^ The 
amplificafions conditions are typically as foUows. Reaction components: 10 mM Tns-HGl. 
pH 8 3 50 mM potassium chloride, 1.5 mM magnesium chloride, 0.001% gelatin, 200 
microM (uM) dATP. 200 microM dGTP. 200 microM dGTP, 200 microM dTTP. 0.4 microM 
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pnmers, and lOtXunits per n.l Taq polymerase. Program: 96 C for 3 min., 30 cycles of 96 C 
^ % lor 45 sec 50 C forM sec, 72 for 60 sec. followed by 72 C for 5 mm. 

Polynucleotides may also be synthesized by well-known techniques as 
described m the technical literature. See, e.g., Carruthers e, c.l. Cold Spring Harbor Symp. 
. Quant Biol 47:41 1-418 (1982), and Adams a/., ./. ^m. Chem. Soc. 105:661 (1983). 
Double stranded DNA fragments may then be obtained either by synthesizing the 
complementary strand and annealing the strands together under appropriate conditions, or by 
adding the complementary strand usmg DNA polymerase with an appropriate pnmer 
sequence. 

10 Analysis of LF t-l Txpne Sequences 

The genus of LECl nucleic acid sequences of the invemion includes genes and 
gene products identified and characterized by analysis using the sequences nucleic acid 
sequences, including SEQ ID N0:1. SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:19 and SEQ 
ID NO-21 and protein sequences, including SEQ ID N0:2, SEQ ID NO:20 and SEQ ID 

15 NO-22 LECl sequences of the invention include nucleic acid sequences having substantial 

identity to SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:19 and SEQ ID NO:21 . 
LECl sequences of the invention include polypeptide sequences having substantial identify 
to SEQ ID NO:2, SEQ ID NO:20 and SEQ ID NO:22. 

LECl nucleic acid sequences also include fusions between two or more LECl 
20 genes Different domains of different genes can be fused. LECl gene fusions can be linked 
directly or can be attached by additional amino acids that link the two of more fusion 

partners. ^ . 

Gene fusions can be generated by basic recombinant DNA techniques as 

described below. Select.on of gene fusions will depend on the desired phenotype caused by 

25 the gene fusion. For mstance, ,f phenotypes associated with the A domain of one LECl 

protein are desired with phenotypes associated with the B domain of a second LECl protem. 

the a fusion of the first LEC, protetn's A domain to the second LECPs B domain would be 

created. The fusion can subsequently be tested ,« W,™ or ,„ for the desired phenotypes. 

30 . ..f n,..leic aci d^ "f ■"vention to inhibit pcne expression 

The isolated sequences prepared as described herein, can be used .0 prepare 
expression cassettes useful m a number of techniques. For example, expression cassettes of 
the inventton can be used to suppress endogenous LECl gene cxpresston. lnh,b,t,ng 
expression can be useful, for instance. ,n weed control (by transferring an inhibitory sequence 
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,„ a weedy speeies and allowing it .0 be transmitted tltrougl, sexual erosses, or to produce 

fruit with small and non-viable seed. 

A number of methods ean be used to inhibit gene expression m plants. Ktr 
instance, antisense technology ean be conveniently used. To accompl.sh this, a nucleic acd 
5 segment from the desired gene ts cloned and operably linked to a promoter such that the 

antisense strand of RNA will be transcribed. The expression cassette ts then transformed tn.o 
plants and the antisense strand of RNA ,s produced. In plant cells, it has ^J^^^" 
antisense RNA inhibits gene expresston by preventing the accumulation of mRNA wh.ch 
encodes the enzyme of interest, see, e.g., Sheehy e, Proc Na,. Acai. SO- USA. 
0 85 88C5-8809(l988>,andHiatt«r,/.,U.S.PatentNo.4.80l,340. 

The antisense nucleic acid sequence transformed into plants w,ll be 
substantially identical to at least a portion of the endogenous embryo-specflc gene or genes 
.0 be repressed. The sequence, however, does not have to be perfectly identical to tnhtb.t 
expresston. The vectors of the present invention can be destgned such that the .nhtbttory 
, 5 effect applies to other proteins wtthin a fam.ly of genes exhibiting homology or substannal 

homology to the target gene. , n u^ ,u 

For antisense suppression, the introduced sequence also need not U full length 

relative to either the pnma^ transcriptton product or fully processed mRNA. Generally, 
htgher homology can be used to compensate for the use of a shorter sequence. Furthermore 
,0 th introduced sequence need no, have the same intron or exon pattern, and "omol gy of non- 
coding segments may be equally effective. Normally, a sequence of between abot^, or 0 
nucleotides and about full length nucleondes should be ttsed, though a sequence of a, least 
a,„„t 1 00 nucleotides is preferred, a sequence of at least about 200 nucleotides ,s more 
preferred and a sequence of a. least about 500 nucleotides is especially preferred. 
„ ' catalytic RNA molecules or ribozymes can also be used to inhrbit expresston 

' of embryo-specfic genes. It ts possible to design ribozymes that speetfically pair wt.h 
vtrtually any targe. RNA and cleave the phosphodiester backbone a. a spec.tc location, 

reby fun tionally mactivating the target RNA. In canning out this cleavage the „ ozyme 
I not Leif altered, and is thus capable of recycling and cleaving other molecules, makmg it a 
30 true enzyme. The inclusion of ribozyme sequences within antisense RNAs confers 
RNA-cleaving activity upon them, thereby increasing the activity of the constructs. 

A number of classes of ribozymes have been idemified. One class of 
ribozymes is derived from anumber of small circular RNAs that are capable of self-cleavage 
and replication in plants. The RNAs replicate either alone (viroid RNAs) or with a helper 
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V.US ,.a,elH.e RNAs,^ Examples .nclude RNAs from avo.adc sunblo.ch v.roid and .he 
satellite RNAs fron, .obaeco rin.spo. virus, lucerne rransien, srrcat virus, velve, .obace., 
,n„„le virus, solanum nodiflorum nroUle v.rus and subterranean elover mCle v.rus. he 
des„„ and use of .arge. RNA-speeif,c r,bo.yn,es ,s described in Haseloff e, al. N..n. 

334:585-591 (IW 

Anolher method of suppression is sense suppression. Introduet.on of 

expression cassettes ,„ which a nucleic acid is configured ,n the sense oHenta.ton »,th respect 

1 promoter has been shown to be an effeCtve means by wHtch to bloc, the transcnpt.on 

of target genes. For an example of the use of this method to modulate expresston of 

llL genes see, Napo.i et al., C,, .27,-28, (19,0), and U.S. Patents Nos. 

S 034 323 5,231,020, and 5,283,184. 

Generally where inhibitton of expression is desired, some transcrtpt.on of the 

mtrodueed sequence occurs. The effect may occur where the imroduced sequence conta.ns 
no coding sequence per se, but only intron or untranslated sequences — ^ ° 
sequences present in the primary transcript of the endogenous sequence. The ,n.oduce 
.elenee generally will be substant.ally identical to the endogenous sequence tntended to 

.Ltity might exenamoreeffecttverepresstonofexpressionoftheendogenous.^^^^^^ 
substantially greater identity of more than about 80% is pre.erred, though abo 95 / to 
bsolute tdentUy would be most preferred. As with anttsense regulation, the effect should 
apply to any other proteins within a similar family of genes exhibiting homology or 

substantial homology. . 

For sense suppression, the introduced sequence m the expresston cassette, 

„eed,„g less than absolute tdemity, also need not be full length, relattve to either the prtmary 
. IscrUn Proauct or fully processed mRNA. This may be preferred to avo.d eoncurren 
p oduin of some plants which are overexpressers. A higher identt.y in a shorter than ful, 
ength sequence compensates for a longer, less identtcal sequence. Furthermore, t e 
introduced sequence need no. have the same intron or exon parte., and ,dent,ty of no,^^ 
coding segments will be equally effecttve. Normally, a sequence of the s,.e ranges noted 
n above for antisense regulation is used. 

one of skill in the art w,ll recognize that us.ng technology based on specfic 
nucleottde sequences (c. , antisense or sense suppression technology), families of 
homologous genes can be suppressed with a single sense or anttsense transcnpt. 
Ince ,r a sense or anttsense transcnpt is designed to have a sequence that ,s conse.ed 
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a,.cng a family of g.n^s U;.. ..-= B d„n,a,„ of LECl ), then „,uUiple n,e„,bers o, a ,e„= 
family can be suppressed. Conversely, if .he goal is .o only suppress one member ol a 
homologous gene famrly. .ben ,be sense or an.rsense .ransenp. should be .arge.ed ,0 
.e,uenees wi.h .he mos. vairanee be.ween family members Por msranee, an an.r^nse 
,„„,„ip, idenrieal .o .he A and C domains of LECl ean be used ,o suppress LECl «,.bou, 
suppressing rela.ed genes sueh as deserrbed in SEQ ID N0:19 or SEQ ,DN0:2L 

Another means of inhibiting LECl funetion in a plam is by ereat.on of 
dommant negative mu.at.ons. In th.s approach, non^netronal, mutant EEC, polypept.des, 
„bieb retain .be abi.,.y to rnteraet «,th »ild-type subunrts are introdueed tnto a plant. 
Me«if,ea.ion of residues ,bat ean be changed to create a dominam negative ean be 
L™,ned by published .or. exammmg interaction of drfferen, subunits o BE Komologs 
from different spee.es (see, e.g.. Smha e, a,.. (1995). rroc. NaU. Aca,^ Sc. USA 



92:1624-1628.) 
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UiaofjmiJeicadfc^ 
^^^j:;:^;;;;:;:::;;;;:^^ herem can also be used to prepare 

expression casse,.es which enhance or increase endogenous LECl gene express.on^ Where 
ore pression of a gene is desired, .he des.red gene from a drfferen. species may be used to 
se potential sense suppressron effects. Enhanced expression of EEC, po ynueleo.rdes 
Tuseful, for example, to mcrease storage prote.n content in plant ttssues. Such techm<,ues 
mav be pa«ieularly useful for improving the nutritional value of plartt ttssues. 

Any of a number of means well known in the art ean be used to mcrease LECl 
aetivity in plants. Er^aneed expression ,s useful, for example, to induce embyome 
le erils in plants or plant organs. Any organ ean be targeted, such as shoot vege.a.,ve 
o :Le.„resL. leaves, stems and tubers), roots, flowers and Horal otgan.s.ruc.ures 
; , bracts, sepals, petals, stamens, carpels, anthers and ovules), seed (includ.ng embryo, 
endosperm, and seed coat, and frurt. Altennatrvely, one or several EEC, genes can be 
expressed cons.itu.ively (e.g., using the CaMV 35S promoter). 

brte of skill w,ll recognize tba. the polypeptides encoded by the genes of the 
invention, like o.he^pro.eins, have drfferen. domains which perform different funeuons. 
Is. .hegenese,uenees needno.be full length,solo„gasthedes,redfunet,ona^ 

.he prorein ,s expressed. As explained above, LECl polypep.ides are related .o CCAAT 
:xUgfaeL.CBE,pr„,ei..s.CBEsareh,ghlyeonser..edfamilyof«^^^ 

.hat regulate gene aetivr.y m eukaryotie organ.sms (see. e.g.. Mantvan, ,1992) Aad. 
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20^ 1 087. K»l -, Li ( 1 9.2) AWc. AaJs Res. 20. 1 087- 1 091 ). LEC 1 wa. found ,o have 
hi,K s,™.3ri.y .o a por.,o„ of the HAP, suhuni, of CBF, Thus, wi.hou, being bound io an, 
panieula, theory or meehanisn,. LEC, ,s Ukely .0 act as a transcrip.iona, modulator. H AP3 
is dtvtded into three domaitts, an atnino terminal A domain, a centra. B domain, and a 
earboxyl terminal C domatn, as shown d,agramma.,ea.,y in Ftgure 2A. ^P^" ' 
has between abou, 75% and 85% sequence similarity, whtch is equivalent to 55 to 63 /. 
sequence idemity, .«.h the B domatns of the other HAP3 homologs shown in Figure 2B, see 
also. Example 1, below. Figure 2B shows the amtno acid sequence homology between LECl 

and other CBF homologs. . 

The LECl polypeptide also has an amino terminal A domam. a central B 
domain, and a carboxyl terminal C domain. The three domains of the LEC. polypeptide are 
defined as follows: ,n SEQ ID NOa the A domain is located between about amtno ac, 

.0 about position 1 ,7; and, the C domain is located between about posttion . . 8 to about 
position 208. The B domain of LEC, L.L and P„aseo,us LIL are all closely related, 
whereas the A and C domains display almost no homology to each other. 

V nucleotide sequence for LECl corresponding to each domam ts dtsp.ayed 
in SEQ ID NO li .g.. the A domain is .ocatcd between about nuc.eot.de position . to about 
:u:i!o.,"posit,onU.heBdomai„,slocatedbe.weenabou.nncleo.tdeposi.ion83t^^^ 

nucleotide position 35t,the C domain is located between about nucleotide postUon 352 to 
about nucleotide position ^24. 

One of skill in the an will recognize that the domain boundaries are 
approximate. The boundaries for the domatns of the LEC , polypeptides and nttCeotides cat, 
1 from 1 to 20 amino acids residues (1-60 nucleottdes) from the boundancs hstcd above. 

The DN A binding activity, and. therefore, transcription activation fnnc.on, of 
LEC. polypeptidesrs though, to be modulated by a short region of seven residues, MPIANVI 
(found at residues 34-40 of SEQ ID NO: 2). Thus, the polypepttdes of the .nventton 

will often retain these sequences. 

^jr^i^fiti^" of pndotienou s 1 ,F,C1 genes 

Methods for tntrodncing genetic mutattons into p.am genes and selecting 
plants with desired traits are well known. For mstance, seeds or other plant ma.ertal can be 
treated with a mutagenic chemtcal substance, according to standard techntques. Such 
chemical substances include, but arc not hmited to, the followtng: dteth.l sulfate, ethylene 
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,n,i„=. ethyl ,.c,ha„=sulfona.c and N-n,.roso-N.e.l,ylurea. Alternatively, iontz.ne rad.ation 
from sources such as. X-rays or samma rays can be used. 

Modif.ed protein chains can also be readily designed utiliz.ng various 
recombtnan, DNA techn.ques well knov.., ,„ those skilled in the art and described for 
5 instance, in Sambrook et al.. supra. Hydroxylamine can also be used to introduce single base 
n^utations into the cod.ng regton of the gene (Srkorski, e, »/.. 1 199„. Me.k E..y.oL 194: 
30.-3 1 8, For example, the chains can vary from the naturally occurrtng sequence a. the 
primal structure level by amino acid substitutions, additions, deletions, and the h^e^ These 
Idifications can be used in a number of combinations to produce the final mod.fted protem 

Alternatively, homologous recombination can be used to induce targeted gene 
modifications by specifically targeting the LECl gene i. Wv„ (.v.-c. generally. Grewal and 
Klar 0>„.,c., 14. 1221..23S (19,7, and Xu . r... 0.„« .ev. 10: 24, 1-2422 (1996,,. 
Homologous recombmafion has been demonstrated in plants (Puchta « .... E.penen.a : 
,5 ,77.284 (1994), Swobodae,<t/,,£MiiO^. ,3: 484-489 (1994): Offringa />r.c. Na„. 
SO USA 90: 7346-7350 (1993 and Kempin c, Na.ure 389:802-803 (1997)), 
m applying homologous recombination technology to the genes of the 
invention, mutations m selected portions of an LEC, gene sequences (including 5' upstream, 
3- downstream, and intragenic regions, such as those dtsclosed here are made in.Uro and 
20 then introduced into the desired plam using standard techniques. Stnce the efficiency of 
homologous recombination isknow^to be dependenton the vectors use,useo^d_ 

gene targeting vectors as described by Mountford c, Pr... Na„. Maa^ Sc.. USA U 4 0 - 
4307 „994)- and Vaulont e, aL Transgenic Res. 4: 247-255 (1995, are eonven.ently used to 
inctease the efficiency of selecting for altered LEC, gene expression in transgenic plants. 
„ The mutated gene will interact with the target wild-type gene in such a way that homologous 
recombination and targeted replacemem of the wild-type gene will occur m transgenic plant 
cells, resulting in suppression of LECl activity. ,„fO»„„d 
Alternatively, oligonucleotides composed of a conUguous stretch of RN A and 
DNA residues in a duplex conformation with double hairptn caps on the ends can be used. 
30 The RNA/DNA sequence is designed to align with the sequence of the target IK. gene and 
to contain the desired nucleotide change. Introduction of the chimeric oligonucleotide on au 
extrachromosomal T-DNA plasmid results m efficient and specific LECl gene convers.n 
,.ected by chimeric molecules in a small number of transfonrted plant cells. This method ,s 
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described m Col.-Strauss c, aL Science 273: . 386- ,389 ( 1 9%) and Yoon c, oi ,'r.,. Nc„i 
4c-i,J Sci USM93;2071-2076(1996). 

Desired modif.cd LEC 1 polypeptides can be idenlitled using assays to screen 
for the presence or absence of wild type LEC I activity^ Such assays can be based on the 
abilitv of tite LECl protein to functionally complemen, the hap3 mutation tn yeas,. As noted 
above it has been sho«n that hontologs from different species functionally interact wtth 
yeast subunits of the CBF. (Sinha, e, <„„ (1995,. Pr... .a,,. Sc, USA 92:1624-1628,; 

see also. BecKer, e, al.. (1991,. Pro. ml AcM. Sci USA 88: .968-1972,. The reponer for 
thi^ screen can be any of a number of standard reporter genes such as the laeZ gene eneodtng 
beta-galacosidase that ,s fused with the regulatory DN A sequences and promoter of the yeas, 
CYCl gene. This promoter is regulated by the yeast CBF. 

A plasmid containing the LEC 1 cDN A clone is mutagenized in vitro 
according to technicues »ell known in the art. The cDNA tnserts are excised from the 
plasmid and insetted into the cloning site of a yeast expression vector such as pYES2 
(invitrogen,. The plasmid is introduced ,n,o hap3- yeas, containing a lacZ reporter ,ha, ,s 
regulated by .he yeas. CBF such as pLG265UPl-lacZ (Guaren.e, c, ./,, (1984, Cell 36: 
317-321 1 Transformanrs arc .hen selected and a HUer assay is used ,o ,es, coloraes for 
be.a-galactosidase activity. After confirming the results of activity assays, immunochemtcal 
tests using a LEC , anubody are performed on yeast lines ,hat lack bcta-galactosidase act,v„y 
to identify those .ha. produce srable LECl pro.ein bu, lack acivity. The mutant LEC. genes 
are then cloned from ,he yeast and thetr nueleolide sequence determined .o iden.ify .he na.ure 
of the lesions. 

,n other embodiments, the promoters derived from the LECl genes of the 
invention can be used to drive express.on of heterologous genes in an embryo-spee.fic or 
seed-specific manner, such that desned gene products are present in the embryo, seed, or 
fruit suitable structural genes that could be used for this purpose include genes encoding 
proteins useful in rncreasing the nutrittona, value of seed or fruit. Examples .nclude genes 
eneodtng enzymes involved in the biosynthesis of antioxidants such as vttamin A vttamtn C 
vitamm E and melatonin. Other su.table genes encoding protems involved in modtficatton o, 
fatt, actds, or in the biosynthesis ofltpids, proteins, and carbohydrates. Still other genes can 
be those encoding proteins tnvolved in auxin and auxin analog b.osynthests for increastng 
fruit size, genes encoding pharmaceufcally useful compounds, and genes eneodtng plant 
resistance products to combat fungal or other infections of the seed. 
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Typically, desired promoters are identified by analyzing the 5' sequences of a 
genomic clone corresponding to the embryo-specific genes described here. Sequences 
characteristic of promoter sequences can be used to identify the promoter. Sequences 
controlling eukaryotic gene expression have been extensively studied. For instance, promoter 
5 sequence elements include the TATA box consensus sequence ( TATAAT), which is usually 
20 to 30 base pairs upstream of the transcription start site. In most instances the TATA box 
is required for accurate transcription initiation. In plants, further upstream from the TATA 
box at positions -80 to -100, there is typically a promoter element with a series of adenines 
surrounding the trinucleotide G (or T) N G. J. Messing et al., in Genetic Engineering in 
10 Plants, pp. 221-227 (Kosage, Meredith and HoUaender, eds. (1983)). 

A number of methods are known to those of skill in the art for identifying and 
characterizing promoter regions in plant genomic DNA (see, e.g., Jordano. et al. Plant Cell 
1- 8S5-866(1989); Bn.tos,et aU Plant Cell 1:839-854(1989); Green, a/., 7, 
4035-4044 (1988); Meier, et al. Plant Cell 3, 309-316 (1991 ); and Zhang, et aL Plant 
15 Physiology 1 10: 1069-1079 (1996)). 



Preparation of recnmhinant vectors 

To use isolated sequences in the above techniques, recombinant DNA vectors 
suitable for transformation of plant cells are prepared. Techniques for transforming a wide 
20 variety of higher plant species are well known and described in the technical and scientific 
literature. See. for example, Weising et al Ann. Rev. Genet. 22:421-477 (1988). A DNA 
sequence coding for the desired polypeptide, for example a cDNA sequence encoding a full 
length protein, will preferably be combined with transcriptional and translational initiation 
regulatory sequences which will direct the transcription of the sequence from the gene in the 

05 intended tissues of the transformed plant. 

For example, for overexpression, a plant promoter fragment may be employed 
which will direct expression of the gene in all tissues of a regenerated plant. Such promoters 
are referred to herein as "constitutive" promoters and are active under most environmental 
conditions and states of development or cell differemiation. Examples of constitutive 

30 promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation region, 
the r- or 2'- promoter derived from T-DNA of Agrobacterium tumafaciens, and other 
transcription initiation regions from various plant genes known to those of skill. 

Alternatively, the plant promoter may direct expression of the polynucleotide 
of the invention in a specific tissue (tissue-specific promoters) or may be otherwise under 
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more precise environmental control (inducible promoters). Examples of tissue-specific 
promoters under developmental control include promoters that initiate transcription only in 
certain tissues, such as fruit, seeds, or flowers. As noted above, the promoters from the EECl 
genes described here are particularly useful for directing gene expression so that a desired 
5 gene product is located in embryos or seeds. Other suitable promoters include those from 
genes encoding storage proteins or the lipid body membrane protein, oleosin. Examples of 
environmental conditions that may affect transcription by inducible promoters include 
anaerobic conditions, elevated temperature, or the presence of light. 

If proper polypeptide expression is desired, a polyadenylation region at the 3'- 
10 end of the coding region should be included. The polyadenylation region can be derived 
from the natural gene, from a variety of other plant genes, or from T-DNA. 

The vector comprising the sequences (e.g., promoters or coding regions) from 
genes of the invention will typically comprise a marker gene which confers a selectable 
phenotype on plant cells. For example, the marker may encode biocide resistance, 
1 5 particularly antibiotic resistance, such as resistance to kanamycin, G41 8, bleomycin, 
hygromycin, or herbicide resistance, such as resistance to chlorosluforon or Basta. 

LECl nucleic acid sequences of the invention are expressed recombinantly in 
plant cells to enhance and increase levels of endogenous LECl polypeptides. Alternatively, 
antisense or other LECl constructs (described above) are used to suppress LECl levels of 
20 expression. A variety of different expression constructs, such as expression cassettes and 
vectors suitable for transformation of plant cells can be prepared. Techniques for 
transforming a wide variety of higher plant species are well known and described in the 
technical and scientific literature. See, e.g., Weising et al. Ann. Rev. Genet. IIAIX-All 
(1988). A DNA sequence coding for a LECl polypeptide, e.g., a cDNA sequence encoding 
25 a full length protein, can be combined with cis-acting (promoter) and trans-acting (enhancer) 
transcriptional regulatory sequences to direct the timing, tissue type and levels of 
transcription in the intended tissues of the transformed plant. Translational control elements 
can also be used. 

The invention provides a LECl nucleic acid operably linked to a promoter 
30 which, in a preferred embodiment, is capable of driving the transcription of the LEC 1 coding 
sequence in plants. The promoter can be, e.g.. derived from plant or viral sources. The 
promoter can be, e.g., constitutively active, inducible, or tissue specific. In construction of 
recombinant expression cassettes, vectors, transgenics, of the invention, a different promoters 
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can be chosen and employed to differentially direct gene expression, e.g., in some or all 

tissues of a plant or animal. 

Vl vpically. desired promoters are identified by analyzing the 5' sequenees ol a 
genomic elone a,rrespond,ng to the embryo-specille genes deseribed here. Sequences 
charae.eris.ic of promCer sequences can be used to identify the promo.er. Sequences 
controUing eukaryCc gene expression have been extensively studted. For instance, promoter 
sequence elements tnclude the TATA box consensus sequence (TATAAT), which ,s usually 
20 to 30 base pairs upstream of the transcriptton Stan stte. In most insutnces .he TATA box 
is required for accura,e transcription mi.ia.ton. In plants, further upstream from the TATA 
box at posttions -80 to -1 00, .herels typically a promoter element with a series of adentnes 
surrounding ,he trinucleotide-0 (or T) N O. J. Messing e. al.. in Genetic Engineenng ,n 
Plants pp 221-227 (Kosage, Meredith and Hollaender.eds. (1983 )), A number of methods 
are known to those of skill m the art for identtfying and characterizing promoter regions ,n 
plant genomic DNA, see, e,g.,Jordano.««/„ 1:855-866(1989); Bustos, , 

P,an,Cel, 1:839-854(1989); Green, V- ^"""^"^^ U^^.e.al. 
Plam Cell 3, 309-316 (1991); and ZhangN(1996) Mam Physiology' 1 10:1069-1079), 

Constitutive Promoters 

A promoter fragment can be employed which will direct expression of LECl 
nucleic acid in all transformed cells or tissues. e,g, as those of a regenerated plant. Such 
promoters are referred to herein as "const.tuttve" promoters and are active under most 
environmental conditions and states of development or cell differenriatton. Promoters that 
drive expresston conttnuously under physiological conditions are referred to as "constttuttve 
promoters and are active under most environmental conditions and states of development or 
cell differentiation. Examples of constitutive promoters include those from vtruses whtch 
infect plants, such as .he cauMower mosatc virus (CaM V) 35S transcription init.ation regton 
,see eg Dagless, 1997) ^W. ,42:183-191); .he V- or 2'- promoter derived from T- 
DNA of Agrobac.er,um.umafaciens (see, e,g„ Mengis.e (1997, supra; O^Grady (1995) 

Biol 29-99-108); the promoter ofthc tobacco mosaic virus: the promoter ot Ftgwor. 
mosatc virus (see, o,g„ Matti (1997) no„s,..,ic 6:143-156,; aettn P— , such as the 
, Arab,dcpsisactin gene promoter, see, e,g„ Huang, 1997) «r„„A.o. Bio,. " -''^'^ 
alcohol dehydrogenase (Adh) gene promoters (see, e,g„ Millar (1996) Plan, Mol. B.oL 
3, 897-904^.(^r/;from,.™*..^^^^^^^^ 

(■„,jfrom.™Wo..vMGenBankNo,U43147,Zhong<,,»/,M,/.Oe„,C.,,e,, 251:196-203 
,,996,) the gene encoding s,earoyl-acyl carrier protein desa.urase from Bro.sico nam. 
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H K NO X74782 Solocon.be c. Pla. P^siol. 104:1167-1 176 (1994)), GPcl from 
(Genbank No. X7478i. . . , ... g,,,, 908-551-565 (1989)), C;pc2 from 

■ ^rvnR-ink No XI 5596. Martinez ct al. ,/. Mo/. Biol .us.^:^ 
maize (OenBanK iNo. , o- ; t-i-q? n ? (1997)). other 

• ,r R.nkNo U45855.Manjunathe/a/.,P/a"'A/^'/-fi"^'-33.^7-112(W^/)). 

... ^^^^^^^^^^^^^^ ^ J_^^, p,,, ,,.es known to those of skill. See also 

::rm9:rc^^^^ 
^"""~:;apia^^ 

acid of the invention under the influence of changing environmental conditions or 
.V Fxamoles of environmental conditions that may effect 

n 997) Plant Mol. Biol. 33:897-909). 

* AUernafvely, plan, pro.o.ers which are ,nducih,e upon exposure ,o plan, 

to exoress the nucleic acids of the invention. For 

example, the invenuon «-a . , , i ^.107 An7V the auxin- 

T V rr in n 997) ^^v.v'o/. 1 1 5 .397-4U / ), tnc au;^ 

, ,he so,hea„ (O ye.ne '^^^ 

responsive ^"^'^"'^ ° ' 7 ™ „ ,,,, „.„n.indueibie pa,C promo,er frona 
peroxi<ie)(Chen(1996)Pto>.(.^ 10.955 '^OO), , „, ,S,rei, ( 1997) Mo/ 

,o , ■nQQfi,n-906-913);aplantbiotinresponseelemen,(S,rti,(199;)M 

3 -^<^^=-':::ro.!:::^^ 

' „h,cheanheapplie:,o,heplan.,suehasher«.eso,a„,,hio,.a^^^^ 
.enuc,eieae,.or,heinve„,,o„.Porexan,ple.,he.a^^^^^^^^^^ 

n,eS.SV, apphea,.on of d f^ren, he ^^^^^ ^^^^^^ 

30 pa„erns, including expression rn ,he roo,, h „,,„a„cible pr„mo,er. e.g., 

codrng sequence can also he under ,he con.rol of. e.f. « ^^.^^ 
„.,escr.hedwi,h,ransge„lc.o^co.a.^^^ 

decarboxylase gene (Masgrau (199/) Plan, J. 1 1 .46. - 
element(Stange(1997)«./.ll:1315-1324. 
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. , ,„ a specific tissue (ussue-specific promo.crs). Tissue spccfc promo.ers are 

ra:;::::— 

1 /v.r nrimarilv only) in certain tissues, such as 
vegetative tissues, e.g., roots or lea , P .^^^^.,p,,ific promoters 

from maize, Slieridan (1996) Oenei.cs 142J009 1020, ^rabidopsis. 
,05934 Abler (1 993) Pto'W.'t BW. 22-.1013,-1038; v.vparous-l from Arab p 

r I N U93215- atmye, from Arabidopsis, Urao (,996) Plan, Moi B.o,. 32:571-57, 
GenbankNo.U93215,atmyc „ R,„„iea napus GerrBank No. J02798, 

(1995 ) 197:264-271. u h in Reiser (1995) Ce// 83:735-742, 

The ovule-specific BELl gene descrtbedmRetser (1995 ) 

. 1,39944 can also be used. See also Ray (1994) P.oc.«^c../. 5c. f/S^ 

GeiaBaiJ. No. U39944, can ^ ^^^^^^ reproductive 

91:5761-5765. The egg and central cell specific FIEl promoter 

tissue-specific promoter. 1 pvnress LEG 1 nucleic acids 

Sepal and petal specific promoters are also used to express LEG 

r nnner For example, the Arabidopsis floral homeot.c gene 

primordia, and lacer becomes loeaUzed '^P-'^-^ ^^^p;;,;,, for AP2, 

a noral homeouc ,ene ,ha. .s ' .^.„ ,.,,,.,002; Bowman (1991) 
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organs .u.o, .e„e of A.abidopsis. whose expression ,s 
ween sepal and petal pnn^ordia (Bossinge, (IW) Dn^P^en, ,22:1093-1 <)2). 

A maize pollen-specific promoter has been identified in ma.ze (Guerrero 
0,,0, OV„ G.n.,- 224:161-168). Other .encs speetfcally expressed in pollen arc 
escr hed e . , hy Wa.eley ,199S, P,.. B,„. 37:187-192: F.c.er C1998, A«. 0». 
257:132-142: KuliKanskas <,997, B.o, 34:S09-814; Treac, (1997, 

Mo/, Bii';, 34:603-611. 

Other suitable promoters tnclude those from genes eneodmg embryome 
storage proteins. For example, the gene encoding the 2S storage protein from Brasstea napus. 
, Dasgupta (1993, Gene 133:301-302; the 2s seed storage protein /-"^ J"" 

Arabidopsis; the gene encoding oleosin 20kD fronn Brassica ^^^^ ^^^''^^'^^^^^^ 
„e genes encodtng o.eosin A, Gcnban. U091 1 S, and, oleostn B^Genban N »9m 
from soybean; .he gene encoding olcostn from Arabtdops.s. Genban. No. Zi7657, th .ene 
encoding oleosin 1 SkD from maize, GenBanR No. «52, 2, Lee „ 994, P,on, Mol .oL 
, ,,.„SM987; and. the gene encoding iow molecular weigh, sulphur rich pro.e.n from 
^ sovbean. Choi (1995, Mo, 0.„. 0>„«. 246:266-268, can be used, ^^-'-""^^fjj^^ 
promoter from tomato is parttcularly useful for d.recting gene expression so that a des.red 
pene product is located in fruits. _ 

A tontato promoter active during fruit ripentng, senescence and absctssron o, 
20 leaves and, .0 a lesser extent, of flowers can be used (Blume (,997, P.an, 1 ,2 73,-746). 
Lrexemplarypromoterslncludethepistolspectficpromotermthepo.^^^^^^^^ 

tuberosum L., SK2 gene, encoding a pistil-specifie basic endochtfnase (F.cUer (1997, 
MO, B,o, 35-425-43 „; the B,ec4 gene from pea (Ptsum sativum cv. Alaska,, acve tn 
Heraltissueofvegetativeandfioralshootapicesoftransgenic alfalfa. — 
25 useful too, to target the expression of foretgn genes to the ep.derma, ,ayer of acttvely 

" *°"a variety of promoters specifically active ,n vegetative tissues, such as leaves, 
stems roots and tubers, can also be used to express the LEC, nucleic acds of the inventton. 
rimple, promoters controlling patatin. the ma,or storage protein of the P"-;*- 
30 b I see e.g.,Klm<1994,«.„,A«. ^ 26:603-615; Martin (1997, P/r,. ./. 1 1 :53-62. 
;h :RP,3 promoter from Agrobactenum rh.zogenes which exh.bits high ae.tv.ty ,n roots 
1 also be used (Hansen (1997, Mo, 0.. 0.«,. 254:337-343. Other useful v.geta ve 
can also DC use ^ , ^ • of the tiene encodina a globulin from a 

tissue-specfic promoters tnclude: the tann promoter of the fccne 

..or taro (Colocasia esculenta L. Schott) corm protem famUy. tarm (Be.erra (1995) P,a. 
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28. 37-144,; .he curcul.n promoter aCrve during .arc eon. developnren. ,de 
Cas.ro ( . 9«r, rla„, Cell 4: , 549- 1559, a,.d ,he pron.„.cr for .he .obacco roo.-spec.nc gene 
T0.RB7, whose expression ,s ,oeahzed ,o roo. nreris.en. and ,n,n,a.„re cen.ra, e,hnder 
regions (Yamamo.o (1991, />/a«CV.ff 3:371-382), 

, eaf-speeific promoters, such as the nbulose h.phosphate earboxylase (RBC S, 
promoters ean be used. For example, the .o^ato RBCSl, RBCS2 and RBCS3A genes are 
pissed ,n leaves and l.gh.-grown seedlings, on, RBCS, and RBCS2 are expressed m 
dlloprng .oma.0 fruits (Merer (1997, F,BS U.I. 4, 5:9,-95, A rihulose h.sp osphate 
L J.la e promoters expressed almos, exe.usivel, in ntesoph.ll eells .n leaf Hades and leaf 
ILL hi h levels, deser,hed .y Ma,su„.a (1994, rl.n. , 6:3,1-3,9, ean he used, 
t r ,ea. speetfie promo.er is ,he ligh. harvesting ehlorophyll a/h h.ndmg pro.e.n gen 
Ano.herlea, sp ^ ^^^^ 

U (1996", FBBSU,,. 379:1 17-,2,, is leaf-speeife. The A.myb5 promoter ,s expressed .n 
d op , .=af MeHontes, s,rpu.es, and epidermal ee„s on .he margins of young rosette a 

i ne le ves and ,n immature seeds. Atmvb5 mRN A appears between fert,l,.a.,on and the 
::::!:a;o'f embryodevelopmertt and persrstsbeyondtheheart stage. Aleafpro,™.er 

• u o ci. nqq?^ P/an? / n-1285-l295, can also be used, 
irlpntified in maize by Busk (IvV /K'""' " • 

Another elass of useful vegetative trssue-speerfie promoters are mer.stemat.e 
(root tip and shoot apex, promoters. For example, *e -SHOOTMEWSTEMLESS" and 
-SCAlLcROW promoters. v.hieh are aetive ,n the developing shoot or root ap.ca 
mertstl deseribed by Di Laurenzio (1996, Ce« 86:423-433; and. Long (1996) 
™ t69 ean be used. Another useful promoter Is that whieh eon.rols .he express.on of 
3.hydroxv.3. me.hylglutaryl coenzyme A reductase HM02 gene, whose express.on .s 
, e ,e.edtomeristematieandf,oral,seeretoryzoneofthes.igma.ma.urepo.^^^^^^^^^^^ 

gynoee,umvasculart,ssue.andfertilizedovn,es,tissues,see..g^Br^n 0^^^^ 

7:5,7-527,. Also useful are .„l-re.a,ed genes Irom marze °' " ^ 
meristem-speeife expression, see. e.g.. Granger ,1996, ,W Mo . """-^^^^ 

11 c \ <inn 1R87- Hake (1995) Fhilos. Trans. R. Sot. Land. u. oiut. 
Kerstetter(1994)? fl"K e//6:1877-188/, mKeuv-^ ^ 

, "!.3. Porexa.ple,the A.aHaop.s.haUanaKNATl promoter. In the shoot apex, 
KnIt transcnpt .s localized pnmanly to the shoot apical menstem; the — ion «f 

u • tPn. decreases during the floral transition and is restricted to the 

KNATl in the shoot meristem decreases during ,,,,.10,01 r76) 

cortex of the inflorescence stem (see. e.g., Lincoln i 
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0„e of skill will recoenize tha. a tissue-specinc promoter may drive 
express,o„ of operabW Imked .e,ue„ces m .issues o.„er than .he iargec ussuc. TTrus, as used 
:L a .issuelpecfre promo.er is one .har drives expression preferenrial, in *e rar.er 

tissue. b„. may also lead ,o some expression in other tissues as well. 

In another embodiment, a LEC 1 trueleie acid is expressed through a 

..artsposable element. Thts allows for constttutive, ye. periodte and infret,„e„. expression of 
r nstttuttve, active polypeptide. The tnvention also provides for use of ttssucspecftc 
promoters derived from vtruses whtch can include, e.g., the tobamovtrus subgenomtc 
, ,Kun,^^., 0995) Proc.N.,l Acad Sci. ™ 92:1679-1683; the rtce tungro 

rZler whtch drives strong phloem-speciftc reporter gene expression, the cassava ve.n 
Isate virus (CVMV) promoter, with highest activity in oscular elements. ,„ lea, mesophyl, 
cells, and ,n roo.ttps (Verdagucr (1996) Plan. Moi BU 31:1 129-1 139). 

Production of transgenic plants 

of the .nvemlon may be introduced into the genome of the 

aesired plant host by a variety of conventional techniques. For example, the DNA constntet 
m be „.rodue=d directly into the genomic DNA of the plan, eel, using teehtr.ues such as 
:toporatio„a„dm,eroi„ieetionofplan,ce,,pro.oplas.s,or.heO.Aco„st™ctscanb= 

i„.rodueed direc.ly to plant tissue using ballisttc methods, such as DNA part, le 
bombardtuent. Alternatively, the DNA constructs may be eombtned wtth suttabl T-DNA 
flawing regions and introduced into a conventional Agrobacteriuna tumefae.ens host vec.r^ 
TILL functions of the Agrobacterium tumefaciens host will direct .he mset^on of the 
Ilruct and adiacent marker into the plan, eel, DNA when .he cell is infec.ed by .he 

" M,croiniee.,on teehnmues are known tn the art and well desenbed m the 

,„ent,fte and patent Itterature. Tire tntroduction of 

glycol precipitatton is descrtbed in Paszkowski et al. En,.o ., y21^1-2n2 ( 984, 
Eleetroporatton techniques are described in .romm et P... »^ 
,0 <„85, Ballistic .ransfo^a.ion .echnit,ues are described in Kle.n e. al, 327.70-73 

Agrobacerrum tumefaciens-mediated transformation techniques, tncluding 
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example Horsch »/. .SV-V„« 233.496-498 0<.84,, and Praley « Proc N.i A..^ 

,e.h„,qucs .an be cultured .o regenerate a whole plan, which possesses ,he .ransfornred 

rioZe and .hus .he desired phe„o.,pe such as seedlessness. Such re.enera, on .echu.ues 

: ola„lpu,a„„n of c=r.ai„ phy.ohor.ones In a ..ssue cuhure .ow.h .e ... ..p.c.iy 

„, in, on a bioclde and/or herhrclde .arRer whrch has been ,n.roduced .oge.her w, h he 

d! e nuc,eo.,d= sequences. Plan, regeneration fro.. cnUured pro.oplas,s Is descr, e . 

Evans e. al Pro.op,as.s lso,a..o„ and Culture, Handbook of Plant Cell Culture, pp^ 14-1 76, 

Jllan PublLn. Conrpan. New Vor. >9S3; and Blndln. Regeneration o ants. 

•> ni CRC Press Boca Raton, 1 985. Regeneration can also be 
Plant Protoplasts, pp. 2,-73, CRC Pres. . 

obtained fron, plan, callus, explan.s, organs, or parrs .hereol. Su j 

are described generally in Klee e, „/. A., ^cv. «fPlu«, P>,ys- 38:467-486 (1987). 

The nucleic acds of ,he .nven.ion can he used .o confer desired .ra„s on 
essen.ially any plan.. Thus, the Invention has use over a broad range « 
species from the genera Asparagus, Afropa, Avena. Brass.ca, Ctrus, C.rullus. Ca tcum, 
C^mls CucurL Daucus. Fragaria, Glycine, Gossypiu., Helianrhus. Heterocall.s, 
HO I Hyoscyanrus, UCuca, Un„., Loliun., Lycopersicon, Mains. Manlhot, M.orana. 

NlconL, Oryza. Panieun,, Pannesetunr, Persea, Prsum, Pyrus, Prunus, Ra^^us, 
Secale Senecio, Srnap.s, Solanu., Sorghunr, Trigonella, Tnticum, V,.,s. Vrgna, and, Ze . 
're EC gene of .he inven..on are parilcularly useful in .he produClon of .ransgen.c plan.s 
I .he genu! Brasslca. Examples include broccoli, cauliflower, brussel sprouts, canola, and 
the like. 
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^ can be expressed together ^vUh other 

structural or regulatory ger.es to achieve a desired effect. A eel, or plan, s^h as a 
transformed cel. or a transgenic plant, can be transformed, engineered or bred .o^ -express 
both LECl nucleotide and/or LECl polypeptide, and another gene or gene product. 

T FC 1 nucleic acids can be co-expressed together m the same 
Alternatively, two or more LECl nucleic acius c 

plant or cell. ^ ^ ^^^^^^.^^^ ^^^^ ^^^^^^^^^ p^^^^ 

reproductive or vegetative tissue, can induce ectopic embryo morphogenesis. Thus, in one 
r:::!. . Jc l nuCic a^ of the invention is expressed ,n a sense conformation m a 
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transgenic plant to mduce the expression of ectopic embryo-like structures, as d.scussed 
above In another embodiment, LEG 1 is co-expressed with a gene or nucle.c ac.d that 
increases reproductive tissue mass, e.g., increases fruit size, seed mass, seed protem or seed 
oils For example, co-expression of ant.scnse nucle.c acid to ADC genes, such as AP2 and 
RAP^ <^enes of Arabidopsis, will dramatically increase seed mass, seed protein and seed o.ls; 
see e g Jofuku, a/., WO 98/07842; Okamuro (1997) Proc. Natl. Acad. Sa. USA 
94 '7076-7081; Okamuro (1997) Plant Cell 9:37-47; Jofuku (1994) Plant Cell 6:1211-1225. 
Thus co-expression of a LECl of the invention, to induce ectopic expression of embron.c 
cells and tissues, together with another plant nucleic acid and/or protein, such as the seed- 
mass enhancing antisense AP2 nucleic acid, generates a cell, tissue, or plant (e.g., a 
transgenic plant) with increased fruit and seed mass, greater yields of embryonic storage 

proteins, and the like. 

m another embodiment, the LECl nuclete acds of the invention are expressed 

in plant reproductive or vegetative cells and tissues which lack the ability to produce 
functional ADC genes, such as AP2 and RAP2 genes. The LECl nucleic acid can be 
expressed ,n an ADC "knocKouf transgentc plant. Alternative,,, the LEC, nucleic acd can 
be expressed in a cell, tissue or plant expresstng a mutant ADC nucleic acid or gene product. 
Expression of LEC, nucle.c acd in any of these non-function.n, ADC modeis will also 
produce a cell, fssue or plant wtth increased fruit and seed mass, greater yields of embryontc 
Storage proteins, and the Uke. 

One of skill wi„ recognize that after the expression cassette is stably 
mcorporated in transgenic plants ^d confirmed to be operable, it can be introduced into other 
plants by sexua, crossing. Any of a number of standard breedmg techniques can be used, 
; depending upon the species to be crossed. 
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Example 1 

This example describes the isolation and characterization of an exemplary 

LECl gene. 

F.x perimeptal Procedures 
Plant Material 

A lecl-2 mutant was identified from a population of Arabidopsis thaliana 
ecotype Wassilewskija (Ws-0) lines mutagemzed with T-DNA insertions as described before 
(West et al 1994). The abi3-3, fus3-3 and lecl-1 mutants were generously provided by 
Peter McCourt, University of Toronto and David Meinke, Oklahoma State University. Wild 
type plants and mutants were grown under constant light at 22"C. 

Double mutants were constructed by intercrossing the mutant lines lecl-1, 
lecl-2 abi3-3 fus3-3, and lec2. The genotype of the double mutants was verified through 
backcrosses with each parental line. Double mutants were those who failed to complement 
both parent lines. Homozygous single and double mutants were generated by germmatmg 
intact seeds or dissected mature embryos before desiccation on basal media. 
i.ol.tinn and Sequence anal ysis of Genomic and cDNA Clones 

Genomic libraries of Ws-O wild type plants, lecl-1 and lecl-2 mutants were 
made in GEMU vector according to the instructions of the manufacturer (Promega). Two 
silique-specific cDNA libraries (stages globular to heart and heart to young torpedo) were 

made in ZAPII vector (Stratagene). 

\he genomic library of lecl-2 was screened using right and left T-DNA 
specific probes according to standard techniques. About 12 clones that cosegregate with the 
mutation, were isolated and purified and the entire DNAs were further labeled and used as 
probes to screen a southern blot containing wild type and lecl-1 genomic DN A. One clone 
hybridized with plant DN A and was further analyzed. A7.1 kb Xhol fragment contaimng 
the left border and the plant sequence flanking the T-DNA was subcloned into 
pBluescript-KS plasmid (Stratagene) to form ML7 and sequenced using a left border specific 
primer (5' GCATAGATGCACTCGAAATCAGCC 3'). The T-DNA organization was 
partially verified using southern analysis with T-DNA left and right borders and PBR322 
probes The results suggested that the other end of the T-DNA is also composed of left 
border This was confirmed by generating a PC'R fragment using a genomic plant DNA 
primer (LP primer 5' GCT CTA GAC ATA C AA C AC TTT TCC TTA 3') and a T-DNA left 
border specific primer (5' GCTTGGTAATAATTGTCATTAG 3') and sequencing. 
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\he KcoRl insert ol-ML7 was used to screen a wild type genomic library. 
Two overlappn.g clones were purilled and a 7.4 EcoRl genomic fragment from the w.ld type 
DNA region was subcloned mto pBluescript-KS plasm.d making WT74. Th,s fragmem was 
sequenced (SEQ ID NO: 4) and was used to screen lecl-1 genomic library and w.ld type 
5 silique-speafic cDNA libraries. 8 clones from the lecl-1 genomic Hbrary were identified 

and analyzed by restriction fi^apping. 

from these clones the exact site of the deletion in lecl-1 was mapped and 
sequenced by amplifying a Xbp PGR fragment using primers (H21 - 5' H21 - 5' CTA AAA 
ACA TCT ACG GTT CA 3'; H 17 - 5' TTT GTG GTT GAC CGT TTG GC 3') flanking the 
10 deletion region m lecl-1 genomic DNA. Clones were isolated from both cDNA libraries 
and partially sequenced. The sequence of the cDNA clones and the wild type genomic clone 
n^atched exactly, confirming that both derived from the same locus. All hybridizations were 
performed under stringent conditions with 32P random prime probes (Stratagene). 

Sequencing was done using the automated dideoxy chain termination method 
1 5 (Applied Biosystems, Foster City, CA). Data base searches were performed at the National 
Center for Biotechnology Information by using the BLAST network service. Alignment of 
protein sequences was done using PILEUP program (Genetics Computer Group, Madison, 
WI) 

nNA and RNA blot analysis 
20 Genomic DNA was isolated from leaves by using the CTAB-containing buffer 

Dellaporta, et al., (1983). Plant Mol. Biol. Reporter 1:19-21. Two micrograms of DNA was 
digested with different restriction endonucleases, electrophoretically separated in 1% agarose 
gel and transferred to a nylon membrane (Hybond N; Amersham). 

Total RNA was prepared from siliques, two days old seedlings, stems, leaves, 
25 buds and roots. Poly(A)+ RNA was purified from total RNA by oligo(dT) cellulose 

chromatography, and two micrograms of each Poly(A). RNA samples were separated m l^/o 
denatured formaldehyde-agarose gel. Hybridizations were done under stringent conditions 
unless it specifies otherwise. Radioactive probes were prepared as described above. 



rom plementation of le d mutants 

A 3.4 kb Bslyl fragment of genomic DNA (SHQ ID NO: 3) containing 
sequences from 1.992 kb upstream of the ORF to a region 579 bp downstream from the poly 
A site was subcloned into the hygromycin resistant binary vector pBIB-Hyg. The LFXl 
cDNA was placed under the control of the 35S promoter and the ocs polyadenylation signals 
by msertmg a PCR fragment spanning the entire coding region into the plasmid pARTV. The 
entire regulatory fragment was then removed by digestion with Notl and transferred into the 
hygromycin resistant binary vector BJ49. The binary vectors were introduced into the 
Agrobacterium strain GV3101, and constructions were checked by re-isolation of the 
plasmids and restriction enzyme mapping, or by PCR. Transformation to homozygous lecl-1 
and lecl-2 mutants were done using the in planta transformation procedure (Bechtold, et al., 
(1993) Comptes Rendus de f Academic des Sciences Serie 111 Sciences de la Vie, 316: 
1 194-1 199 Dry seeds from led mutants were selected for transformants by their ability to 
germinate after desiccation on plates containing 5g/ml hygromycin. The transformed plants 
were tested for the present of the transgene by PCR and by screening the siliques for the 
present of viable seeds. 
In ^\iu Hvhridization 

Experiments were performed as described previously by Dietrich et al. (1989) 

Plant Cell 1 : 73-80. Sections were hybridized with LECl antisense probe. As a negative 
control the LECl antisense probe was hybridized to seed sections of led mutants. In 
addition, a sense probe was prepared and reacted with the wild type seed sections. 



Results 

^..„.,;. ln,e,.r.iion P----" ' '"f- Colvl.-'inn-TYPe Mutants and abi3 

In order to understand the genetic pathways which regulate late embryogenests 
we took advantage of three Arabidopsis ntutants lec2. fus3-3 and abi3-3 that cause similar 
defects in late embryogenesis to those of lecl-1 or lecl-2. These mutants are desiccat.on 
rntolerant. sometimes vtviparous and have activated shoot apical meristems. The lee2 and 
fus3-3 mutants are sens.live ,c AB A and possess trichomcs on their cotyledons and therefore 
can be categorized as leafy eotyledon-type mutants (Meinke et al.. 1994). The ab,3-3 
mutants belong to a different class of late embryo defective mumtions that is insenstttve to 
ABA and docs not have trichomcs on the cotyledons. 

The two classes of mutants were crossed to lccl-1 and lecl-2 mutants to 
construct plants homozygous to both mutations. The led and Iec2 mutations tnteract 
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synergistically, resulting in a double mutant which is arrested in a stage similar to the late 
heart stage, the double mutant embryo, however, is larger. The lecl or lec2 and fus3-3 
double mutants did not display any ep.stasis and the resulting embryo had an intermediate 
phenotype. The lecl/abi3-3 double mutams and lec2/abi3-3 double mutants were ABA 
insensitive and had a lee-like phenotype. There was no different between double mutants that 

consist of either lecl-1 or lecl -2. 

No epistasis was seen between the double mutants indicating that each of the 
above genes, the LEC-type and ABI3 genes, operate in different genetic pathways. 
T.RCl Functions Rarlv i n Fmhrvoeenesis 

The effects of lecl is not limited to late embryogenesis, it also has a role in 
early embryogenesis. The embryos of the lecl/lec2 double mutants were arrested in the early 
stages of development, while the single mutants developed into mature embryos, suggesting 
that these genes act early during development. 

Further examination of the early stages of the single and double mutations 
showed defects in the shape, size and cell division pattern of the mutants suspensors. The 
suspensor of wild type embryo consists of a single file of six to eight cells, whereas the 
suspensors of the mutants are often enlarged and undergo periclinal divisions. Leafy 
cotyledon mutants exhibit suspensor anomalies at the globular or transition stage whereas 
wild type and abi3 mutant do not show any abnormalities. 

The number of anomalous suspensors increases as the embryos continue to 
develop. At the torpedo stage, the wild type suspensor cells undergo programmed cell death, 
but in the mutants secondary embryos often develop from the abnormal suspensors and, when 
rescued, give rise to twins. 

Twv..n;..tinn of the LE n T orns in Wild Type Plants and lecl Mutants 

Two mutant alleles of the LECl gene have been reported, lecl-1 and lecl-2 
(Meinke 1992' West et al., 1994). Both mutants were derived from a population of plants 
mutagemzed insertionally with T-DNA (Feldmann and Marks, 1987), although lecl-1 is not 
tagged. The lecl-2 mutant contains multiple T-DNA insertions. A specific subset of T-DNA 
fragments were found to be closely linked with the mutation. A genomic library of lecl-2 
, was screened using right and left borders T-DNA as probes. Genomic clones containing 
T-DNA fragments that cosegregate with the mutation were isolated and tested on Southern 
blots of both wild type and lecl-1 plants. Only one clone hybridized with Arabidopsis DNA 
and also gave polymorphic restricfion fragment in lecl-1 . 
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The lecl-1 polymorphism resulted from a small deletion, approximately 2 kb 
in length. Using sequences from the plant fragment flanking the T-DNA, the genomic w.ld 
type DNA clones and the lecl-1 genomic clones were isolated. An EcoRl fragment of 7.4 kb 
of the genomic wild type DNA that corresponded to the polymorphic restriction fragment in 
5 lec 1 - 1 was further analyzed and sequenced. The exact site of the deletion in lec 1 - 1 was 

identified using a PGR fragment that was generated by primers, within the expected borders 
of the deleted fragment, and sequencing. 

In the wild type genomic DNA that corresponded to the lecl-1 deletion, a 626 
bp ORF was identified. Southern analysis of wild type DNA and the two mutants DNA 
10 probed with the short DNA fragment of the ORF revealed that both the wild type and lecl-2 
DNA contain the ORF while the lecl-1 genomic DNA did not hybridize. The exact insertion 
site of the T-DNA in lecl-2 mutant was determined by PGR and sequencing and it was found 
that the T-DNA was inserted 1 1 5 bp upstream of the ORF's translational initiation codon in 

the 5' region of the gene. 
j3 At the site of the T-DNA insertion a small deletion of 21 plant nucleic acids 

and addition of 20 unknown nucleic acids occurred. These results suggest that in lecl-2 the 
T-DNA interferes with the regulation of the ORF while in lecl-1 the whole gene is deleted. 
Thus, both led alleles contain DNA disruptions at the same locus, confirming the identity of 
the LECl locus. 

20 The led Mutants C^n Re Com plement hv Transformation 

To prove that the 7.4 kb genomic wild type fragment indeed contained the 
ORF of the LECl gene, we used a genomic fragment of 3395 bp (SEQ ID NO: 3) within that 
fragment to transform homozygous Iecl-1 and lecl-2 plants. The clone consists of a 3395 
bp BstYI restriction fragment containing the gene and the promoter region. The translation 

25 start codon (ATG) of the polypeptide is at 1999 and the stop codon is at 2625 (TGA). There 

are no introns in the gene. 

The transformed plants were selected on hygromycin plates and were tested to 
contain the wild type DNA fragment by PGR analysis. Both transgenic mutants were able to 
produce viable progeny, that were desiccation tolerant and did not posses trichomes on their 
30 cotyledons. We concluded that the 3.4 kb fragment can complement the led mutation and 
since there is only one ORF in the deleted 2 kb fragment in lecl-l we suggest that this ORF 
corresponds to the LEGl gene. 
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The LECl Gene is a Member of G ene Family 

In order to isolate the LECl gene two cDNA libraries of young siliques were 
screened using the 7.4 kb DNA fragment as a probe. Seventeen clones were isolated and after 
further analysis and partial sequencing they were all found to be identical to the genomic 
ORF. The cDNA contains 626 bp ORE specifying 208 amino acid protein (SEQ ID NO:l 

and SEQ ID NO:2). 

The EECl cDNA was used to hybridize a DNA gel blot containing Ws-O 
genomic DNA digested with three different restriction enzymes. Using low stringency 
hybridization we found that there is at least one more gene. This confirmed our finding of 
two more Arabidopsis ESTs that show homology to the LECl gene. 
The LECl pene is Emb rvo Specific 

The led mutants are affected mostly during embryogenesis. Rescued mutants 
can give rise to homozygous plants that have no obvious abnormalities other than the 
presence of trichomes on their cotyledons and their production of defective progeny. 
Therefore, we expected the LECl gene to have a role mainly during embryogenesis and not 
during vegetative growth. To test this assumption poly (A)+ RNA was isolated from siliques, 
seedling, roots, leaves, stems and buds of wild type plants and from siliques of led plants. 
Only one band was detected on northern blots using either the EEC 1 gene as a probe or the 
7.4 kb genomic DNA fragment suggesting that there is only one gene in the genomic DNA 
fragment which is active transcriptionally. The transcript was detected only in siliques 
containing young and mature embryos and was not detected in seedlings, roots, leaves, stems 
and buds indicating that the LECl gene is indeed embryo specific. In addition, no RNA was 
detected in siliques of both alleles of led mutants confirming that this ORF corresponds to 
the LECl gene. 

Expression Pattern o f the LECl Gene 

To study how the LECl gene specifies cotyledons identity, we analyzed its 
expression by in situ hybridization. We specifically focused on young developing embr>'os 
since the mutants abnormal suspensors phenotype indicates that the LECl gene should be 
active very early during development. 

During embryogenesis, the LECl transcript was first detected in proglobular 
embryos. The transcript was found in all cells of the proembryo and was also found in the 
suspensor and the endosperm. However, from the globular stage and on it accumulates more 
in the outer layer of the embryo, namely the protoderm and in the outer part of the ground 
meristem leaving the procambium without a signal. At the torpedo stage the signal was 
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Stronger in the cotyledons and the root meristem, and was more Hmited to the protoderm 
layer. At the bent cotyledon stage the signal was present throughout the embryo and at the 
last stage of development when the embryo is mature and filling the whole seed we could not 
detect the LECl transcript. This might be due to sensitivity limitation and may imply that if 
the LECl transcript is expressed at that stage it is not localized in the mature embryo, but 
rather spread throughout the embryo. 

The LECl gene encodes a Homolog of CCAAT binding factor. 

Comparison of the deduced amino acid sequence of LECl to the GenBank 
reveals significarH similarity to a subunit of a transcription factor, the CCAAT box binding 
factor (CBF). CBP^ are highly conserved family of transcription factors that regulate gene 
activity in eukaryotic organisms Mantvani, et ah, . (1992). Nucl. Acids Res. 20: 1087-1091 . 
They are hetero-oligomeric proteins that consist of between three to four non-homologous 
subunits. LECl was found to have high similarity to CBF-A subunit. This subunit has three 
domains; A and C which show no conservation between kingdoms and a central domain, B, 
which is highly conserved evolutionary. Similarly the LECl gene is composed of three 
domains. The LECl B domain shares between 75%-85% similarity and 55%-63% identity 
with different B domains that are found in organisms ranging from yeast to human. Within 
this central domain, two highly conserved amino acid segments are present. Deletion and 
mutagenesis analysis in the CBF-A yeast homolog hap3 protein demonstrated that a short 
region of seven residues (42-48) (LPIANVA) is required for binding the CCAAT box, while 
the subunit interaction domain lies in the region between residues 69-80 (MQECVSEFISFV) 
(Xing et al., supra). LECl protein shares high homology to those regions. 

DISCUSSION 

The led mutant belongs to the leafy cotyledon class that interferes mainly 
with the embryo program and therefore is thought to play a central regulatory role during 
embryo development. It was shown before that LECl gene activity is required to suppress 
germination during the maturation stage. Therefore, we analyzed the genetic interaction of 
homozygous double mutants of the different members of the leafy cotyledon class and the 
abi3 mutant that has an important role during embr>^o maturation. All the five different 
combinations of the double mutants showed either an intermediate phenotype or an additive 
effect. No epistatic relationship among the four genes was found. These findings suggest 
that the different genes act in parallel genetic pathways. Of special interest was the double 
mutant lecl/lec2 that was arrested morphologically at the heart stage, but continued to grow 
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in that shape. This double mutant phenotype indicates that both genes LECl and LEC2 are 
essential for early morphogenesis and their products may interact directly or indirectly in the 
young developing embryo. 
The Role of LECl in Embrvouenesis 

One of the proteins that mediate CCA AT box function, is an heteromeric 
protein called CBF (also called NFY or CPl). CBF is a transcription activator that regulates 
constitutively expressed genes, but also participates in differential activation of 
developmental genes Wingender, E. (1993). Gene Regulation in Eukaryotes (New York: 
VCH Publishers). In mammalian cells, three subunits have been identified CBF-A, CBF-B 
and CBF-C and all of which are required for DNA binding. In yeast, the CBF homolog HAP 
activates the CYCl and other genes involved in the mitochondrial electron transport Johnson, 
et al.. Proteins. Annu. Rev. Biochem. 58, 799-840. (1989). HAP consists of four subunits 
hap2, hap3, hap4 and hap5. Only hap2, 3 and 5 are required for DNA binding. CBF-A, B 
and C show high similarity to the yeast hap3, 2 and 5, respectively. It was also reported that 
mammalian CBF-A and B can be functionally interchangeable with the corresponding yeast 

subunits (Sinha et al., supra.). 

The LEC l gene encodes a protein that shows more then 75% similarity to the 
conserved region of CBF-A. CCAAT motifs are not common in plants' promoters and their 
role in transcription regulation is not clear. However, maize and Brassica homologs have 
been identified. A search of the GenBank revealed several Arabidopsis ESTs that show high 
similarity to CBF-A, B and C. Accession numbers of CBF-A (HAP3) homologs: H37368, 
H76589; CBF-B (HAP2) homologs: T20769; CBF-C (HAP5) homologs: T43909, T44300. 
These findings and the pleiotropic affects of LECl suggest that LECl is a member of a 
heteromeric complex that functions as a transcription factor. 

The model suggests that LECl acts as transcription activator to several sets of 
genes, which keep the embryonic program on and repress the germination process. 
Defective LECl expression partially shuts down the embryonic program and as a result the 
cotyledons lose their embryonic characteristics and the germination program is active in the 
embryo. 



Example 2 

This example demonstrates that LECl is sufficient to induce embryonic 
pathways in transgenic plants. 
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The phenotype of led mutants and the gene's expression pattern indicated 
that LECl functions specifically during embryogenesis. A LFICI cDNA clone under the 
control of the cauliflower mosaic virus 35S promoter was transferred into lecl-1 mutant 
plants in planta using standard methods as described above. 
5 Viable dry seeds were obtained from lecl-1 mutants transformed wath the 

35S/LEC1 construct. However, the transformation efficiency w^as only approximately 0.6% 
of that obtained normally. In several experiments, half the seeds that germinated (12/23) 
produced seedlings with an abnormal morphology. Unlike wild type seedlings, these 
35S/LEC1 seedlings possessed cotyledons that remained fleshy and that failed to expand. 
1 0 Roots often did not extend or extended abnormally and sometimes greened. These seedlings 
occasionally produced a single pair of organs on the shoot apex at the position normally 
occupied by leaves. Unlike wild type leaves, these organs did not expand and did not possess 
trichomes. Morphologically, these leaf-like structures more closely resembled embryonic 
cotyledons than leaves. 

15 The other 35S/LEC1 seeds that remained viable after drying produced plants 

that grow vegetatively. The majority of these plants (7) flowered and produced 100% led 
mutant seeds. Amplification experiments confirmed that the seedlings contained the 
transgene, suggesting that the 35S/LEC1 gene was inactive in these T2 seeds. No vegetafive 
abnormalities were observed in these plants with the exception that a few displayed defects in 

20 apical dominance. A few plants (2) were male sterile and did not produce progeny. One 
plant that produced progeny segregated 25^/o mutant Led" seeds that, when germinated 
before desiccation and grown to maturity, gave rise to 100% mutant seed, as expected for a 
single transgene locus. The other 75% of seeds contained embryos with either a wild type 
phenotype or a phenotype intermediate between led mutants and wild type. Only 25% of the 

25 dry seed from this plant germinated, and all seedlings resembled the embryo-like seedlings 
described above. Some seedlings confinued to grow and displayed a striking phenotype. 
These 35S/LEC1 plants developed two types of structures on leaves. One type resembled 
embryonic cotyledons while the other looked like intact torpedo stage embryos. Thus, 
ectopic expression of LECl induces the morphogenesis phase of embryo development in 

30 vegetative cells. 

Because many 35S/LEC1 seedlings exhibited embryonic characteristics, the 
seedlings were analyzed for expression of genes specifically acfive in embryos. Cruciferin A 
storage protein mRNA accumulated throughout the 35S/LEC1 seedlings, including the leaf- 
like structures. Proteins with sizes characteristic of 12S storage protein cruciferin 
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accumulated in these transgenic seedlings. Thus, 35S/LEC1 seedings displaying an embryo- 
like phenotype accumulated embryo-specitlc mRNAs and proteins. LECl mRNA 
accumulated to a high level in these 35S/LEC1 seedlings in a pattern similar to early stage 
embryos but not in wild type seedlings. LF.Cl is therefore sufficient to alter the fate of 
vegetative cells by inducing embryonic programs of development. 

The ability of LECl to induce embryonic programs of development in 
vegetative cells establishes the gene as a central regulator of embryogenesis. LECl is 
sufficient to induce both the seed maturation pathway as indicated by the induction of storage 
protein genes in the 35S/LEC1 seedlings. The presence of ectopic embryos on leaf surfaces 
and cotyledons at the position of leaves also shows that LECl can activate the embryo 
morphogenesis pathway. Thus, LECl regulates both early and late embryonic processes. 

Example 3 

This example shows that LECl is expressed in zygotes and that the promoters 
of the invention can therefore be used to target expression in zygotes. 

To determine precisely when the LECl gene becomes activated, LECl RNA 
levels were analyzed in the egg apparatus of mature female gametophytes before fertilization, 
in zygotes after fertilization, and in very early stage embryos containing an apical cell and 
two to three suspensor cells. In situ hybridization experiments showed that LECl RNA was 
present in zygotes and early stage embryos but was not detected in female gametophytes. 
These resuhs show that the LECl promoter becomes active in the zygote. The LECl is 
therefore useful to target the expression of sense or antisense versions of regulatory genes or 
cytotoxic genes to zygotes and early stage embryos. 

Example 4 

This example shows the identification of a LECl homolog from Arabidopsis 
designated the LEAFY COTYLEDON 1 -LIKE gene. 

A Blast search was conducted through the Arabidopsis Database 
(http:/,/genome-www.stanford.edu/Arabidopsis/) using the LECl cDNA nucleotide sequence 
as a probe to identify homologs of the HAP3 subunit of CCAAT box binding transcription 
factor from Arabidopsis. The Arabidopsis BAC clone, MN.I7 (Accession Number 
AB025628), contains a gene, designated LECl-Like (LIL). that displays the highest amino 
acid sequence identity with the LECl protein of any known Arabidopsis HAPS gene. The 
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deotide and amino acid sequences of I/L are shown in SEQ ID NO:19 and SEQ ID 

NO:20. respectively. 

The Polymerase Chain Reaction (PGR) was used to amplify the LIL gene, 
which lacks introns. Pnmers designed to amplify the LIL open reading Irame contained 
BamHl and XbaErestnction enzyme sites for cloning purposes. The forward primer, 
BAMMNJ7-5 seqivence .s 5'-AGGAICCATGGAACGTGGAGGCTTCCAT-3' with the 
BamHI site underlirt^d. The reverse pnmer, 3-MNJ7XBA sequence is 5'- 
A TCTAGA TCAGTACTTATGTTGTTGAGTCG-3- with the Xbal site underlined. The 
PGR conditions were as follows: 30 cycles of 45 seconds at 94°G, 45 seconds at 53°G, and 3 
minutes at 72°G. AmphtTaq DNA polymerase (Perkin Elmer Cetus, 761 Main Ave., 
Norwalk, GT 06859) was^ged. PGR products were cloned using the TOPO TA Gloning Kit 
(Invitrogen, Garlsbad, GA 92008). The nucleotide sequence of the cloned LIL gene was 
determined to confirm its idenXity. 
Accumulation of Lp rJ-LIKE Rt^A 

The LIL clone was hybridized with gel blots containing 20 ^ig of total RNA 
from leaves, stems, roots, seedlings, and siliques containing either proembryo to heart stage 
(early) embryos, heart to torpedo stage (middle) embryos, or torpedo to mature (late) 
embryos / / L RNA was detected only in siliques containing all three stages of embryos. 
Detection of the LIL RNA in siliques from lecl-1 mutants showed that the RNA detected 
was not LECl. Thus, like LECl, LIL accumulates specifically during embryogenesis. 
Compleme ntation of lec'-' Mut^^tion bv LIL 

The LIL clone was inserted into the LECl promoter/terminator cassette within 
the plant transformation vector BJ49. The LECl promoter/terminator cassette consists of 
1992 bp of DNA 5' of the LECl translation start codon and 770 bp 3' of the LECl cDNA 
translation stop codon (H.S. Lee, R.W. Kwong, and J.J. Harada, unpublished results). The 
promoter and terminator are separated by a short polylinker with Bglll and Avrll restriction 
endonuclease sites in which the LIL gene was inserted. 

This construct was transferred into homozygous lecl-1 null mutants using in 
planta transformation procedures with Agrobacterium nmefaciens strain GV3 1 01 . Unlike 
) lecl-1 mutant plants whose progeny die following desiccation, plants transformed with the 
LIL construct produced viable seedlings. PGR amplification experiments confirmed that the 
viable seedlings have the lecl-1 mutation and the transgene. Seedlings morphologically 
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resembled «nd type ra.her ,ha„ /«■/ mu,an. plants. These results show that the UL gene 
contplements the mutation, suggesting overlapping functions for the two genes. 



Example 5 

This example shows the identifieation of a LfiC/ ortholog from scarlet runner bean. 

^ A CDNA library was constructed wtth 1 50 ng of total RNA isolated from 

embryo propers (EP) of dre scarlet runner bean (SRB; PHaseolus coccineus, that were 
dissected from globu.ar-stage embryo. The SMART PCR cDN A Ltbrary Constructton Ktt 
(Clonteeh cat # K1051-1 ) was used according to a manufacturer's protocol. Bnefly. first 
strand cDNA was synthes.zed from EP total RNA using Superscript I, RNase H- reverse 
transcriptase (Gibeo/BRL. cat # 18064-014, ,n the presence of an Sf, IB-s.te contamtng 
oligo-dT primer ,CDSIII/3' PCR primer, Clonteeh, and a SMART 11, contain.ng an Sfi lA- 
site primer (Clonteeh). Second strand was generated by polymerase chain reaction ustng - 

u branded cDN A was digested with Sfi 1 restrtction 

and 3' PCR primers (Clonteeh,. Double-stranded CDNA wa g ^„„„„. >, 

enzyme (New England BioLabs, ca, # 123S, and then size-fractionated over a CHROMA S- 

400 sepharose column (Clonteeh). A«er analyzing collected fractions ona ,.,% agarose ge 

four fractons containing htgh amount of cDN As in a range of 0.5 kb to 4 Ub were pooled and 

prectpitated in an ethanol/sal. solution at .20=C overnight. A cDNA pellet was recove^d y 

eentrifugation and resuspended in 7 „L of sterile water. cDNA inserts were Itgated to Sft I- 

digested lambda ar^s (lTriplEx2, Clonteeh). Ligation mixtures were packaged tnto phage 

heads using Gigapack III Gold Packaging Extract (Stratagene). 

U^dM^^U..:^^ fonn to a plasmid form vta 

Cre-Lox systen, (tn vivo excision. Clonteeh). Colonies were pteked randomly for p^trrtd 
DNA isolation. T^e nucleotide sequences of cDNA clones were determtned ustng BtgDye 
terminator, a 5'-TrtplEx sequencing primer (Clonteeh). and the AB, Prism 377 DNA 
sequencer (Perktn-Elmer Applied Biosystems,. The identity of the eDNA clone was 
determined by BlastX and BlastN analyses. 

A BlastX search revealed that a cDN A clone pPCEPl 1 2 encoded a protetn 
(SEO ID NO-22, with high amtno acid sequence identity to the Arabidopsis LECI. espeetally 
. „, the conserved B domain. However, a BlastN result tndtcated that thts SRB cDNA 
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scuenc. of *c pPCEP, ,2 insert was d..er™„cd .0 b= 988 bp (SEO ID N0:2, ,. 

e.b„os, we carried ou. in si.u hybrid,za.,o„ analyses. Tbe fuli ieng.h cDNA inse„ of 
BSEP, U was used as .He .enrpia.e fo, sense and a„„sense RNA probe s.n.bes.s. ^e 
In A accumulared ,n bo.i, *e e.bryo proper (EP, and snspensor (S, o, a 5 days after 
:r,Le.bryo.in.be.days*rpo,,ina.ionseeds..e^Ais,ocan.^^^^^^^^^^^ 

L ep,der.a, layer of .he embryo proper and nrodera.eiy in every ce I ,n bo.b .h embryo 
prop r and snspensor. Oniy background signai was de.eCed usrng .he sense probe. ,n 
Trllion, .h! spa.ia. expression pa.,ern of SRB LECi.e gene in .he .iobuiar en,bryo ,s 
similar to that of Arabidopsis LEC 1 . 

The above examples are provided .o illus.ra,e .he inven.ion bu. no. .o l,m.. i.s 
,eope Cher varian.s of .he invenUon «iil be read.ly apparent .o one of ordinal skiU m the 
: „dareencompassedby.heappendedclaims. A,ip.,blica..ons da.bases Oen^^^^^^^^ 

sequences, patents, and paten, apphcations Ced herein are hereby .ncorpora.ed by 



